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ABSTRACT: 

The hepatitis C virus (HCV) cell culture system according to the invention consists of human 
hepatoma cells, which are transfected with a HCV-RNA construct, that comprises the HCV 
specific RNA segments 5' NTR, NS3, NS4A, NS4B, 

NS5A, NS5B, and 3' NTR as well as a minimum of one marker gene for selection (selection 
gene). 
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(57) The hepatitis C virus (HCV) cell culture system according to the invention consists of human hepatoma cells, 
Which are tfansfected with aHCV-RNA construct, that comprises the HCV specific RNA segments 5' NTR, NS3^ 
NS4A, NS4B, NS5A, NS5B, and 3' NTR as well as a minimum of one marker gene for selection (selection gene). 
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Hepatitis C Virus Cell Culture System 



5 ABSTRACT 

The hepatitis C virus (HCV) cell culture system according 
to the invention consists of human hepatoma cells, which 
are transfected with a HCV-RNA construct, that comprises 
10 the HCV specific RNA segments 5' NTR, NS3, NS4A, NS4B, 
NS5A, NS5B, and 3' NTR as well as a minimum of one marker 
gene for selection (selection gene). 
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CLAIMS 

A hepatitis C virus (HCV) cell culture system, which 
comprises mainly eukaryotic cells containing 
transfected HCV specific genetic material, 
characterized in that, 

the eukaryotic cells are human hepatoma cells and the 
transfected HCV specific genetic material is a HCV-RNA 
construct, which comprises the HCV specific RNA 
segments 5' NTR, NS3, NS4A, NS4B, NS5A, NS5B, and 
3' NTR as well as an additional marker gene for 
selection (selection gene) . 

A cell culture system according to Claim 1, 
characterized in that, 

the hepatoma cells are derived from a usual hepatoma 
cell line. 

A cell culture system according to Claim 1, 
characterized in that, 

the hepatoma cells are extracted from a primary 
hepatoma cell culture. 

A HCV-RNA construct, characterized in that, 
it comprises the HCV specific RNA segments 5" NTR, 
NS3, NS4A, NS4B, NS5A, NS5B, and 3« NTR as well as an 
additional marker gene for selection (selection gene). 

A HCV-RNA construct according to Claim 4, 
characterized in that, 

it comprises one of the nucleotide sequences 
illustrated in the t sequence protocols SEQ ID NO: 1 to 
SEQ ID NO: 11. 

A HCV-RNA construct according t6 Claim 4, 
characterized in that, 
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the 3' NTR has a nucleotide sequence, which has been 
selected from the group of nucleotide sequences (a) to 
(i) listed in the following: 

(a) ACGGGGAGCTAAACACTCCAGGCCAATAGGCCATCCTGTTTTTTTTTTTA 

GCTTTTTTTTTTTTCTTTTTTTTTGAGAGAGAGAGTCTCAGTCTGTTGCC 
CAGACTGGAGT 

(b ) ACGGGGAGCTAAACACTCCAGGCCAATAGGCCATCCTGTTTTTTTTTTTA 

GTCTTTTTTTTTTCTTTTTTTTGAGAGAGAGAGTCTCACTCTGTTGCCCA 
GACTGGAGC 

( c) ACGGGGAGCTAAACACTCCAGGCCAATAGGCCATCCTGTTTTTTTTTAAT 

CTTTTTTTTTTTCTTTTTTTTTGAGAGAGAGAGTCTCACTCTGTTGCCCA 
GACTGCAGC 

( d) ACGGGGAGCTAAACACTCCAGGCCAATAGGCCATCCTGTTTTTTTTTTTT 

AGTCTTTTTTTTTTTCTTTTTTTTTGAGAGAGAGAGTCTCACTCTGTTGC 
CCAGACTGGAGT 

{ e ) ACGGGGAGCTAAACACTCCAGGCCAATAGGCCATCCTGTTTTTTTTTTTA 

GTCTTTTTTTTTTTTCTTTTTTTTTGAGAGAGAGAGTCTCACTCfGTTGC 
CCAGACTGGiAGT 

( f ) ACGGGGAGCTAAACACTCCAGGCCAATAGGCCATCCTGTTTTTTTTTTTA 

GTCTTTTTTTTTTTTCTTTTTTTTTTGAGAGAGAGAGTCTCACTCTGTTG 
CCCAGACTGGAGT " 

( g ) ACGGGGAGCTAAACACTCCAGGCCAATAGGCCATCCTGTTTTTTTTTTTA 

GTCTTTTTTTTTTTCTTTTTTTTTGAGAGAGAGAGTCTCACTCTGTTGCC 
CAGACTGGAGT 

(h) ACGGGGAGCTAAACACTCCAGGCCAATAGGCCATCCTGTTTTTTTTTTTT 

TAATCTTTTTTTTTTTTTTCCTTTTTTTGAGAGAGAGAGTCTCACTCTGT 
TGCCCAGACTGGAGT 
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( i ) AGGGGGAGCf AAACACTCCAGGCCAATAGGCCATCCTGTTTTTTTTTTTT 
AATCTTTTTTTTTTTTTTCTTTTTTTTTTGAGAGAGAGAGTCTCACTCTG 
TTGCCCAGACf GGAGT 

7. A HCV-RNA construct according to one of the Claims 4 
to 6, characterized in that, 

the marker gene for selection is a resistance gene and 
preferably an antibiotic resistance gene. 

8. A HCV-RNA construct according to one of the Claims 4 
to 6, characterized in that, 

the marker gene for selection is a neomycin 
phosphotransferase gene, 

9. A HCV-RNA construct according to one of the Claims 4 
to 8, characterized in that, 

the marker gene for selection is integrated in the 
HCV-RNA downstream from the 5' NTR. 

10. A HCV-RNA construct according to one of the Claims 4 
to 83, characterized in that, 

the marker gene for selection is linked to the HCV-RNA 
via a ribozyme or a recognition sequence for a 
ribozyme. 

11. A HCV-RNA construct according to one of the Claims 4 
to 10, characterized in that, 

it has an integrated reporter gene. 

12. A HCV-RNA construct according to Claim 11, 
characterized in that, 

the reporter gene is a gene from the group of the 
luciferase genes, the CAT gene (chloramphenicol acetyl 
transferase gene), the lacZ gene (beta galactosidase 
gene), the GFP gene (green fluorescence protein gene) ^ 
the GUS gene (glucuronidase gene) and the SEAP gene 
(secreted alkaline phosphatase gene). 
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13. A HCV-RNA construct according to one of the Claims 4 
to 11, characterized in that, 

their replication has an influence on the expression 
5 of a (cellular) surrogate marker gene. * 

.14. A HCV-RNA construct according to one of the Claims 11 
to 13, characterized in that, 

the resistance gene is cloned into the open reading 
10 frame of the HCV-RNA in such a way that it will only 

be transferred to an active form after proteolytic 
processing. 

15. A HCV-RNA construct according to one of the Claims 11 
15 to 14, characterized in that, 

the reporter gene and the marker gene for selection 
are arranged in the construct in such a way, that they 
are expressed as a fusion protein. 

20 16. A cell culture system according to one of the Claims 1 
to 3, characterized in that, 

the HCV-RNA construct is a construct according to at 
least one of the Claims 4 to 15. 

25 17. A cell culture system according to Claim 1, 
characterized in that, 

the cells containing the HCV-RNA construct are lodged 
at the DSMZ, Braunschweig, Germany under the deposit 
number DSM ACC2394 (laboratory name HuBl 9-13). 

30 

18. A use of a cell culture system according to one of the 
Claims 1 to 3 or 16 to 17 and/or a HCV-RNA construct 
according to one of the Claims 4 to 15 for the 
production and/or evaluation and/or testing of 
35 treatments and/or diagnostics for the treatment of HCV 

infections in particular. 
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19. A use of a Cell culture system according to one of the 
Claims 1 to 3 or 16 to 17 and/or a HCV-RNA construct 
according to one of the Claims 4 to 15 for the 
production of a vaccine against HCV infections. 

20; A use a HCV-RNA construct according to one of the 
Claims 4 to 15 for the production of a liver cell 
specific vector in gene therapy. 

21. A HCV-RNA construct according to one of the Claims 4 
to 15, characterized in that, 

it has an integrated foreign gene and can be used to 
trahsfect this foreign gene in a target cell, which 
can be Used to express this foreign gene. 

12. A process for the production of cell culture adapted 
mutants of a HCV-RNA construct according to one of the 
claims 4 to 15, in which the mutants have increased 
replication efficiency compared to the HCV-RNA 
construct, 

characterized in that, 

a ceil culture system according to claim 1, in which 
the transfected HCV specific genetic material is a 
HCV-RNA construct with a selection gene according to 
one of the claims 4 to 15, is cultivated on/in the 
selection medium corresponding to the selection gene, 
that the cell clone's created are collected, and that 
the HCV-RNA constructs or parts thereof are isolated 
from these cell clones. 

. A process according to claim 22, characterized in 
that, 

the isolated HCV-RNA constructs are passaged at least 
one more time. That is, they are transfected in cells 
of a cell culture system according to claim 1. The 
cell culture system of claim 1 thereby obtained, in 
which the transfected HCV specific genetic material is 
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the isolated HCV-RNA construct with a selection gene, 
is cultivated on/in the selection medium corresponding 
to the selection gene, the cell clones created are 
collected, and the HCV-RNA constructs are isolated 
from these cell clones. 

24. A process for the production of mutants of a HCV full- 
length genome or of a HCV subgenome or any HCV 
construct with increased replication efficiency in 
comparison to the original HCV full-length genome or 
subgenome or HCV-RNA construct, 
characterized in that, 

a cell culture adapted mutant of a HCV-RNA construct 
is produced and isolated by a process according to 
15 claim 22 or 23, 

the nucleotide and amino acid sequence of these 
mutants is determined and the type, number and 
positions of the nucleotide and amino acid mutations 
is determined by comparison with the nucleotide and 
amino acid sequence of the original HCV-RNA construct, 
and these mutations are introduced in an (isolated) 
HCV full-length genome or a HCV subgenome or any HCV- 
RNA construct, either by site-directed mutagenesis or 
by exchange of fragments containing the relevant 
25 mutations 



20 



25. 



36 



35 2 6. 



A cell culture adapted HCV-RNA construct with high! 
replication efficiency, 
characterized in that, 

it is derivable from a HCV-RNA construct according to 
one of the claims 4 to 15 through nucleotide and/or 
amino acid mutations and that it is obtainable with a 
process according to one of the claims 22 to 24. 

A cell culture adapted HCV-RNA construct according to 
claim 25, 

characterized in that, 
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it carries one or several of the following amino acid 
exchanges: 1283 arg -> gly and/or 1383 glu -> ala 
and/or 1577 lys -> arg and/or 1609 lys -> glu and/or 
1936 pro -> ser and/or 2163 glu -> gly and/or 2330 lys 
-> glu and/or 2442 ile -> val. 

7. A cell culture adapted HCV-RNA construct according to 
claim 25 or 26, 
characterized in that, 

it displays one or several of the nucleotide and/or 
amino acid exchanges shown in Table 3, where Table 3 
is an integral part of this claim. 



3. 



Cell culture adapted mutant of a HCV-RNA construct or 
of a HCV full-length genome with increased replication 
efficiency in comparison to the original HCV-RNA 
construct or original HCV full-length genome, 
characterized in that, 

it is obtainable with a process in which the number 
and type of mutations in a cell culture adapted HCV- 
RNA construct according to claim 24 are determined 
through sequence analysis and sequence comparison, and 
these mutations are introduced in a HCV-RNA construct, 
particularly in a HCV-RNA construct according to one 
of the claims 4 to 15, or in an (isolated) HCV-RNA 
full-length genome, either by targeted mutagenesis or 
by exchange of sequence sections containing the 
relevant mutations. 

Hepatitis C virus particles or virus-like particles, 
characterized in that, 

they are obtainable by a process according to one of 
the claims 22-24. 

Cells infected with Hepatitis C virus particles or 
virus-like particles, according to claim 29. 
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Hepatitis C Virus Cell Cultura System 

5 The invention relates to a hepatitis c virus fHCV) cell 
culture system, which compriucu mainly tjuV^/ryol. j ti c:«l 1 53 
crnnr.ai nintj t.ransf «cl.ed HCV .specific genetic material , which 
means they are transfected with tlCV specific genetic 
material . 

UJ 

The hepatitis c virus (HCVl is one of the main cauuns 
wurldvidu ul chronic and sporadic liver diseases. The 
history of moat HCV infections does not involve any obvious 
clinical sign.*, but 80 - 90 I of the infected people become 

IS chronic: uai'Jrlers of Lhw virus and 50 uC LrmsH chronic; 
carrier? of the virus develop chronic hepatitis with 
differujii. cteqit** a of .-itsuKri l.y . Approx. 20 \i of \:ht* 
chronical ly infected develop a cirrhos-is of the liver over 
.10 to 2D years, based on ,wh^L a primary hepatocellular 

20 carcinoma can develop. Nuwadayw chronic: hepatitiu C i« the 
main indication for. liver transplantation, A specific 
therapy does not exist until now. 'fhe only therapy 
currnfit.ly avail ah Ik i s hifjh-rfo.He administration ot 
Interferon alpha or a combination of Interferon alpha arid 

2S the purine nucleoside analogue Ribavirin. However, only 
appro*. 60 5 of all. .treated pcrscms respond to this therapy 
and with thuse, u ok* viraeraia occurK in more Khan half of 
all case* after the discontinuation of the treatment. 

SO 00k to the high prevalence, especially in j ndijiitr La 1 i?.ed 
countries, the serious effects of chronic infections and 
the flOjl-ujiiatenc« Of a apKtrilic therapy, thu development of* 
a HCV Specific chetnu therapy 1b an important qoai of 
pharmaceutical research and development. The main problem 

3S 11 he in the previous lac* of a suitable cell culture 
system, which enables the study of virus replication and 
pathogenesis in eukaryotic cells. 
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Hepatitis C Virus Cell Culture System 

DESCRIPTION 

the; invention relates to a hepatitis C virus (HCV) cell 
culture system, which comprises mainly eukaryotic cells 
containing transfected HCV specific genetic material, which 
means they are transfected with HCV specific genetic 
material. 

The hepatitis C virus (HCV) is one of the main causes 
worldwide of chronic and sporadic liver diseases, the 
history of most HCV infections does not involve any obvious 
clinical signs, but 80 - 90 % of the infected people become 
chronic carriers of the virus and 50 % of these chronic 
carriers of the virus develop chronic hepatitis with 
different degrees of severity. Approx. 20 % of the 
chronically infected develop a cirrhosis of the liver over 
10 to 20 years, based on what a primary hepatocellular 
carcinoma can develop. Nowadays chronic hepatitis C is the 
main indication for liver transplantation. A specific 
therapy does not exist until now. The only therapy 
currently available is high-dose administration of 
Interferon alpha or a combination of Interferon alpha and 
the purine nucleoside analogue Ribavirin. However, only 
aipprox. 60 % of all. treated persons respond to this therapy 
and With these, a new viraemia occurs in more than half of 
all cases after the discontinuation of the treatment. 

Due to the high prevalence, especially in industrialized 
countries, the serious effects of chronic infections and 
the non-existence of a specific therapy, the development of 
a HCV specific chemotherapy is an important goal of 
pharmaceutical research and development. The main problem 
lies in the previous lack of a suitable cell culture 
system, which enables the study of virus replication and 
pathogenesis in eukaryotic cells. 
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Due to the small amount of virus in blood or tissue, the 

lack of suitable cell culture systems or animal models (the 

chimpanzee is still the only possible experimental animal) 

5 as well as the lack of efficient systems for producing 

virus-like particles, it was not possible up to now, to 

* 

analyze the. molecular composition of the HCV particle in- 
depth arid to solve it. The information currently available 
can be summarized as follows: HCV is an enveloped plus- 

10 Strand RNA virus with a particle diameter of 50 - 60 nm and 
a medium density of 1.03 - 1.1 g/ml. It was moleciilarly 
cloned and characterized for the first time in 1989 (Choo 
et al., 1989, Science, 244, 359 - 362). The HCV-RNA has a 
length of approx. 9,6 kb (= 9600 nucleotides), a positive 

15 polarity and comprises one open reading frame (ORF) , which 
encodes a " linear polyprotein of approx. 3010 amino acids 
(see Rice 1996, in Virology, B. N. Fields, D. M. Knipe, P. 
M. Rowley, Eds. (Lippincott-Raven, Philadelphia, PA, 1996), 
vol. 1, pp. ' 931 - 960; Clarke 1997, J. Gen. Virol. 78, 

iO 2397; arid Bartenschlager 1997, Jntervirology 40, 378 and 
see Fig. 1 A). During the replication of the virus the 

. polyprotein is cleaved into the mature and functionally 
active proteins by cellular and viral proteases. 

25 Within the polyprotein the proteins are arranged as follows 
(from the amino- to. the carboxy terminus): Core-El-E2-p7- 
. NS2-NS3-NS4A-NS4B-NS5A-NS5B. The core protein is the main 
component of the nucleocapsid. The glycoproteins El and E2 
are transmembrane proteins and the main componerits of the 

30 viral envelope. They probably play an important role during 
the a,ttachement of the virus to the host cell. These three 
proteins core, El, and E2 constitute the viral particle and 
aire therefore called structural proteins. The function of 
the protein p7 is still not clear. The protein NS2 is 

35 probably the catalytic domain of the NS2-3 protease, which 
is responsible for the processing between the proteins NS2 
and NS3. The protein NS3 has two functions, one is a 
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protease activity in the amino terminal domain, which is 
essential for the polyprotein processing, and the other a 
NTPase/helicase function in the carboxy terminal domain, 
which is probably important during the replication of the 
viral RNA. The protein NS4A is a co-factor of the NS3 
protease. The function of the protein N34B is unknown. 

The open reading frame is flanked on its 5* end by a non- 
translated region (NTR) approx. 340 nucleotides in length, 
which functions as the internal ribosome entry site (IRES), 
and on its 3' end by a NTR approx. 230 nucleotides in 
length, which is most likely important for the genome 
replication, A 3' NTR such as this is the object of patent 
application PCT/US 96/14033. The structural proteins in the 
amino terminal quarter of the polyprotein are cleaved by 
host cell signal peptidase. The non-structural proteins 
(NS) 2 to (NS) SB are processed by two viral enzymes, 
namely the NS2-3 and the NS3/4A protease. The NS3/4A 
protease is required for all cleavages beyond the carboxy 
terminus of NS3. The function of NS4B is unknown. NS5A, a 
highly phosphorylated protein, seems to be responsible for 
the Interferon resistance of various HCV genotypes (see 
Enomoto et al. 1995, J. Clin. Invest. 96, 224; Enomoto et 
al. 1996, N. Engl. J. Med. 334, 77; Gale Jr. et al. 1997, 
Virology 230, 217; Kaneko et al. 1994, Biochem. Biophys. 
Res. Commun. 205, .320; Reed et al., 1997, J. Virol. 71, 
7187), and NS5B has been identified as the RNA-dependent 
RNA polymerase. 

First diagnostic systems have been developed from these 
findings, which are either based on the detection of HCV 
specific antibodies in patient serum or the detection of 
HCV specific RNA using the reverse transcription polymerase 
chain reaction (RT-PCR) , and which are (must be) routinely 
used with all blood and blood products and/or according to 
the regulations. 
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Since the : first description of the genome in 1989 several 
partial and complete sequences of the HCV have been cloned 
and characterized using the PCR method. A comparison of 
these sequences shows a high variability of the viral 
3£nome in particular in the area of thie NS5B gene, which 
eventually resulted in the classification of 6 genotypes, 
which are aigain subdivided into the subtypes a, b, and c. 

The genomic variance is hot evenly distributed over the 
genome. The 5'NTR arid parts of the 3'NTR are highly 
conserved; while certain encoded sequences vary a lot, in 
particular the envelope proteins El and E2. 

The* cloned and characterized partial and complete sequences 
of the HCV genome have also been analyzed with regard to 
appropriate targets for a prospective antiviral therapy. In 
the course of this, three viral enzymes have been 
discovered, which may provide a possible target. These 
include (1) the NS3/4A protease complex, (2) the NS3 
Helicase and (3) the NS5B RNA-dependent RNA polymerase. The 
NS3/4A protease complex and the NS3 Helicase have already 
been crystallized and their three-dimensional structure 
determined (Kim et al., 1996, Cell, 87, 343; Yem et al. , 
1998, Protein Science, 7, 837; Love et al., 1996, Cell, 87, 
311; Kim et al., 1998, Structure, 6, 89; Yao et al., 1997, 
Nature Structural Biology, 4, 463, Cho. et al,, 1998, J. 
Biol. Chem., 273, 15045). it has not been successful until 
now with the NS5B RNA-depehdent RNA polymerase. 

Even though important targets for the development of a 
therapy for chronic HCV infection have been defined with 
these enzymes and even though a worldwide intensive search 
for suitable inhibitors is ongoing with the aid of rational 
drug design as well as high throughput screening, the 
development of a therapy has one major deficiency, namely 
the lack of cell culture systems or simple animal models, 
which allow direct, reliable identification of HCV-RNA or 
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HCV antigens with simple methods which are common in the 
laboratory. The lack of these cell culture systems is also 
the main reason that to date the comprehension of HCV 
replication is still incomplete and mainly hypothetical. 

Although according to the experts, a close evolutionary 
relationship exists between HCV and the flavi^ and 
pestiviruses, and self-replicating RNAs have been described 
for these, which can be used for the replication in 
different cell lines with a relatively high yield, (see 
Khromykh et al., 1997, J, Virol. 71, 1497; BehrenS et al., 
1998, J. Virol. 12, 2364; Moser et al., 1998> J. Virol. 12, 
.5318), similar experiments with HCV have not been 
successful to date. 

Although it is , known from different publications that cell 
lines or primary cell cultures can be infected with high 
titre patiieht serum containing HCV, (Lanford et al. 1994, 
Virology 202, 606; Shimizu et al. 1993, Proceedings of the 
National Academy of Sciences, USA, 90, 6037 - 6041; 
Mizutani et al, 1996, Journal of Virology, 70, 7219 - 7223; 
M. Ikeda et al. 1998, Virus Res. 56, 157; Fournier et al. 
L998, J. Gen. Virol. 79, 2376 and bibliographical 
references quoted in here; Ito et al. 1996, Journal of 
General Virology, 77, 1043 - 1054), these virus-infected 
cell lines or cell cultures do not allow the direct 
detection of HCV-RNA ot HCV antigens. The viral .RNA in 
these cells can not be detected in a Northern Blot (a 
standard method for the quantitative detection of RNA) or 
the viral . protein in a Western Blot or with 
immunbpreCipitatioh. it has only been possible to detect 
HCV replication with very costly and indirect methods. 
These disadvantageous facts show that obviously the 
replication in these known virus-infected cell lines or 
ceil cultures is completely insufficient. 
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Furthermore it is known from the publications of Yoo elt al. 
(1995, Journal of Virology, 69, 32 - 38) and of Dash et 
al., (1997, American Journal of Pathology, 151, 363 - 373) 
that hepatoma cell lines can be transfected with synthetic 
5 HCV-RNA, which are obtained through in vitro transcription 
of the cloned HCV genome. In both publications the authors 
started from the basic idea that the viral HCV genome is* a 
plus-strand RNA functioning directly as ihRNA after being 
transfected into the cell, permitting the synthesis of 
10 viral proteins in the course of the translation process, 
and so new HCV particles are (could be) formed. This viral 
replication, which means these newly formed HCV viruses and 
their RNA, have been detected through RT-PCR. However the 
published results of the RT-PCR carried out indicate, that 
15 the HCV replication in the described HCV transfected 
hepatoma dells is not particularly efficient and is not 
sufficient to measure the quality, let alone the quantity 
of the fluctuations in the replication rate after an 
targeted action with prospective antiviral treatments. 
Furthermore it is prior art (Yanagi et al., Proc. Natl. 
Acad. Sci. USA, 96, 2291-95, 1999), that the highly 
.conserved 3' NTR is essential for the virus replication. 
This knowledge strictly contradicts the statements of Yoo 
et al. and Dash et al., who used for their experiments only 
;HCV genomes with shorter 3' NTRs since they did not know 
the authentic 3' end of the HCV genome. 

; Thfe bbjedt of the present invention is to provide a HCV 
cell culture system^ where the viral RNA self-repiicates in 
the transfected cells with such a high efficiency that the 
quality and quantity of the fluctuations in the replication 
rate can be measured with common methodologies usually 
found in the laboratory after a targeted action with virus 
and prospective HCV specific antivirals in particular. 

The solution to this problem is to provide a cell culture 
system of the prior mentioned type, where the eukaryotic 
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cells are huihain cells, in particular hepatoma cells, which 
are preferably derived from a normal hepatoma cell line, 
but can also be obtained from an appropriate primary ceil 
.fculture, and where the trahsfected HCV specific genetic 
5 material is a HdV-RNA construct, which essentially 
bbmprises the HCV specific RNA segments 5 f NTR, NS3, NS4A, 
NS4B, NS5A, NS5B, and 3* NTR preferably in the order 
mentioned as well as a minimum of one marker gene for 
selection (selection gene) . 

io 

Here and in the following "NTR" stands for. "non-translated 
: riegidh" and is a known and familiar term or abbreviatidh to 
; the Relevant expert. 

15 Here and in the following the term "HCV-RNA construct" 
comprises constructs, which include the complete HCV 
genome, as well as those, which only include a part of it, 
Which means a HCV subgenome . 

20 A preferred variation of the cell culture system according 
to the invention, which had proven to be worthwhile in 
practice, is lodged at the DSMZ, Deutsche Sammlung von 
Hi kroof garii sitien und Zellkulturen GmbH (German collection of 
Microorganisms and Cell Cultures) in Braunschweig, Germany 

25 under the number DSM ACC2394 (laboratory name HuBl 9-13). 

With the ceil culture system according to the invention an 
•J. in vitrb system is provided for the first time, where HCV- 
RNA is self-replicated and expressed intracellularly and in 
3D a sufficient amount, so that the quantity of the amounts of 
: HdV- RNA as well as the HCV specific proteins can be 
determined with conventional and reliably precise 
biochemical measuring methods. This means an almost 
/authentic cell-based HCV replication system is available 
35 for the first time, which is urgently needed for the 
development and testing of antiviral drugs ■. This test 
. system provides the possibility of identifying potential. 
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targets for an effective HCV specific therapy and 
developing and evaluating HCV specific chemotherapeuticals. 

The invention is based on the surprising finding that 
efficient replication of the HCV-RNA only occurs in cells 
if they have been transfected with an HCV-RNA construct, 
which comprises at least the 5' and the 3' non-translated 
regions (NTR) and the non-structural proteins (NS) 3 to 5B 
arid additionally a marker gene for selection (selection 
gene). The structural genes are obviously without great 
importance for replication, whereas efficient replication 
of the HCV-RNA apparently only occurs if the transfected 
c^ils are subject to permanent selection pressure, which is 
imparted by the marker gene for selection (selection gene) 
linked to the HCV-RNA. Consequently the marker gene 
(selection gene) seems on one hand to provoke the selection 
of those cells, where the HCV-RNA replicates productively, 
and it seems oh the other hand to considerably increase the 
efficiency of the RNA replication. 

An object of the invention is also a cell-free HCV-RNA 
construct, characterized in that it comprises the HCV 
specific RNA segments 5' NTR, NS3, NS4A, • NS4B, NS5A, NS5B, 
and 3 1 NTR, preferably in the order mentioned, as well as a 
marker gene for selection (selection gene) . 

In the present context the terms 5* NTR and NS3 and NS4A 
and NS4B and NS5A and NS5B and 3 f NTR comprise each 
nucleotide sequence, which is described in the state of the 
art as the nucleotide sequence for each functional segment 
of the HCV genome. 

By providing a HCV-RNA construct such as this, a detailed 
analysis of the HCV replication, pathogenesis and evolution 
in cell culture is possible for the first time. The HCV 
specific viral RNA can specifically be created as a 
complete genome or subgenome in any amount, and it is 
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poissible, to manipulate the "RNA construct and consequently 
to examine and identify the HCV functions on a genetic 
level* 

Because all HCV enzymes identified as a main target for a 
therapy at the moment, namely the NS3/4A protease, the NS3 
helicase and the NS5B polymerase, are included in the HCV- 
RNA construct according to the invention, it can be used 
for all relevant analyses. 

An embodiment of the HCV- RNA construct, which has proven to 
be worthwhile in practical use, stands out by the fact that 
it comprises; the nucleotide sequence according to the 
sequehce protocol SEQ ID NO: 1 * 

Further embodiments with similar good properties for 
practical use are characterized in that they comprise a 
nucleotide sequence either according to sequence protocol 
SEQ ID NO: 2 or according to sequence protocol SEQ ID NO: 3 
or according to sequence protocol SEQ ID NO: 4 or according 
to sequehce protocol SEQ ID NO: 5 or according to sequehce 
protocol SEQ ID NO: 6 or according to sequehce protocol SEQ 
ID NO: 7 or according, to sequence protocol SEQ ID NO: 8 of 
according to sequence protocol SEQ ID NO: 9 or according to 
sequence protocol SEQ ID NO: 10 or according to sequence 
protocol SEQ ID NO: 11. 

It is possible to provide the HCV subgehomic construct with 
a 3' NTR, which has a nucleotide sequence so far unknown in 
the state of art, a nucleotide sequence, which has been 
selected from the group of nucleotide sequences (a) to (i) 
listed in the following: 

( a ) ACGGGGAGCf AAACACTCCAGGCCAATAGGCCATC 

TTTTTTTTTCTTTTTTTTTGAGAGAGAGAGTCTCACTCTGTTGCCCAGACTGGAG 
T 
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(b) ACGGGGAGCt AAAC ACTCCAGGCC AATA TTTT TAGTCT 

TTTTTTT TTC TTTTTTTTGA GAGAGAGAGT CTCACTCTGT TGCCCAGACT 
GGAGC 

<c) acggggAgctaaacactccaggccaataggccatcct TTTAATCTTT 

TTTttTTTCT TTTTTTTTGA GAGAGAGAGT CTCACTCTGT TGCCCAGACT 
GGAGC 

(d) ACGGGGAGCTAAACACTCCAGGCCAATAGGCCATCCTGTifTTT TTTTTTAGTC 
TTTTTTTTTT TCTTTTTTTT TGAGAGAGAG AGTCTCACTC TGTTGCCCAG 
ACTGGAGT 

(e) ACGGGGAGCTAAACACTCCAGGCCAATAGGCCATCCTGTTTTTT TTTf TAGTCT 
TTTTTTTTTT TCTTTTTTTT TGAGAGAGAG AGTCTCACTC TGTTGCCCAG 
ACTGGAGT 

(f) ACGGGGAGCTAAACACTCCAGGCCAATAGGCCATCCTGTTTtTT TTTTTAGTCT 
TTTTTTTTTT TCTTTTTTTT TTGAGAGAGA GAGTCTCACT CTGTTGCCCA 
GACTGGAGT 

, (gj ACGGGGAGCTAAACACTCGAGG TTTTTAGTCT 
TTTTTTTTTT CTTTTTTTTT GAGAGAGAGA GTCTCACTCT GTTGCCCAGA 
CTGGAGT 

(h) ACGGGGAGCTAAACACTCCAGGCCAATAGGCCATCCTGTTTTTT TTTTTTTAAT 
CTTTTTTTTT TTTTTCCTTT TTTTGAGAGA GAGAGTCTCA CTCTGTTGCC 

. CAGACTGGAG T 

(i) ACGGGGAGCf AAACACTCCAGGC TTTt'TTAATCl 
TTTTTTTTTT TTTTCTTTTT TTTTTGAGAG AGAGAGTCTC ACTCTGTTGC 
CCAGACTGGA GT 

The marker gene for selection (selection gene) included in 
the HCV-RNA constructs according to the. invention is 
preferably a resistance gene, in particular an antibiotic 
resistance gene. 
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This has the advantage that the cells trahsfected with this 
construct can easily be selected from the non-transfected 
cells by adding for example the appropriate antibiotic to 
the cell culture medium in the case of an antibiotic 
resistance gene. 

* 

In the present context 'antibiotic' means any substance, 
Which impedes the nbn-trahsf ected host cells or the cells, 
where the HCV-RNA is not replicating efficiently, 
continuing to live or grow, especially the cell poison 
Puromycin, Hygrdmyciri, Zebcin> Bleomycin or Blasticidin. 

A preferred marker gefie for selection (selection gene) and 
resistance gene, which has proven to be worthwhile in 
practice, is the neomycin phosphotransferase gene. 

Ah alternative fbt the antibiotic resistance genes is for 
example the thymidine kinase gene, which can be used to 
"carry out a HAT selectidh. 

The marker gene for selection (selection gene), the 
preferred resistance gene and the most preferred antibiotic 
resistance gene is preferably positioned in the HCV-RNA 
construct after the HCV 5' NTR* which means downstream from 
the 5' NTR and upstream from the HCV reading frame. 
However, an insertion in the area of the 3 ' NTR or another 
site 6f the HCV genome or subgenome, for example within the 
polyproteih, is ailso conceivable. 

In another embodiment of the HCV-RNA construct according to 
the invention the marker gene for selection (selection 
gene)> in particular ari antibiotic resistance gene, is 
linked to the HCV-RNA or HCV genomic or subgenomic sequence 
via a ribozyme or a recognition site for a ribozyme. 

This has the advantage, that after the selection of the 
cells, in which the HCV-RNA is replicating productively, 
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the resistance' gene in the obtained cell clones can be 
separated froni the HCV subgenomic sequence through a 
ribozyme^-dependent cleavage , namely by activating the 
inserted ribozyme or in the case of a construct with a 
recognition site for a ribozyme, by transfectihg the 
ribozyme into the cells (for example through the 
' transection of a ribozyme construct or infection with a 
viral expression vector, into which the appropriate 
ribozyme has been inserted) . By this means an authentic HCV 
genomic construct can be obtained without a resistance 
gene, which can then form authentic infectious virus 
particles . 

Ahothei: preferred embodiment of the HCV-RtiA construct 
according to the invention is characterized in that the 
construct has at least one integrated reporter gene. 

Ih the following a reporter gene means any gene, whose 
presence can be easily detected with, in general, simple 
biochemical or also histochemical methods after being 
transferred into a target organism, which means a gene, 
that encodes for a protein, which can be easily and 
reliably detected and quantified in small amounts with the 
common measuring methods in the laboratory. 

This variation of the HCV-RNA construct has the advantage 
that the extent of the replication of this construct can be 
easily and quickly measured with the methods common in the 
laboratory using the reporter gene product. 

The reporter gene is preferably a gene from the group of 
the lucif erase genes , the CAT gene (chloramphenicol acetyl 
transferase gene), the lacZ gene (beta galactosidase gene), 
the GFP gene (green fluorescence protein gene), the GUS 
g£ne (glucuronidase gene) or the SEAP gene (secreted 
alkaline phosphatase gene) . this reporter gene and its 
products, namely the relevant reporter proteins, can be 
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detected for example using ' fluorescence, chemiluminescence, 
. colbrimetric measurements or by means of immunological 
methods (for example ELISA) . 

A surrogate marker gene can also be considered as a 
r'epbxter gene. In this context it includes those genes> 
which encode for cellular proteins, nucleic acids or 
generally for those functions/ which are subject to 
variation depending on the replication of the virus, and 
which consequently are either suppressed or activated in 
the cells, in which the HCV or the HCV-RNA construct 
multiplies. This means, the suppression or activation of 
this function is a surrogate marker for the replication of 
the. virus or the replication of the HCV-RNA construct, •'. 

The positions of the reporter genes and the marker gene for 
selection (selection gene) can be selected iri such a way, 
that a fusion protein made from both genetic products will 
be expressed. This has the advantage that th£se two genes 
can be arranged in such a way in the HCV-RNA construct that 
their two expressed proteins are fused via a recognition 
sequence for a protease (for example ubiquitin) or via a 
self-cleaving peptide (for example the 2A protein of the 
Picdrhaviruses) at first and will be separated 
proteolytically later. 

These two positions might' a^ well lie apart from each other 
ih such way, that both genetic products are separately 
expressed (for example in the order: marker or resistance 
gene - inter hal ribosbme binding site - reporter gene) . 

In the case of the reporter gene one embodiment has proven 
to be particularly worthwhile, where the reporter gene is 
cloned into the open reading frame of the HCV genome or 
subgenome in such a way that it will only be transferred to 
an active form after proteolytic processing. 
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This cell culture system according to the invention can be 
used for various purposes in each of its embodiments. These 
comprise: 

• The detection of antiviral substances. This can include 
for example i organic compounds/ which interfere directly 
or indirectly with viral growth (for example inhibitors 
of the viral proteases, the NS3 helicase, the NS5B RNA- 
dependent RNA polymerase), antisense oligonucleotides > 
which will hybridize to any target sequence in the HCV- 
RNA construct (for example the 5' NTR) and will have an 
direct or indirect influence on the virus growth for 
example due to a reduction of translation of the HCV 
polyprotein or ribozymes, which cleave any HCV- RNA 
sequence and consequently impair virus replication. 

» The evaluation of any type of antiviral substances in 
the cell culture. These substances can be detected on 
'the isolated purified enzyme for example with 'rational 
drug design' or 'high-throughput screening'. Evaluation 
means mainly the determination of the inhibitory 
features of the respective substance as well as its mode 
of action. 

The identification of new targets of. viral or cellular 
Origin for a HCV specific antiviral therapy. If for 
example a cellular protein is essential fot viral 
replication, the viral replication can also be 
influenced by inhibiting this cellular protein. The 
system according to the invention also enables the 
detection of these auxiliary factors. 

The determination of drug resistance. It can be assumed 
that resistance to therapy occurs due to the high 
mutation rate of the HCV genome. This resistance, which 
is very important for the clinical approval of a 
substance, can be detected with the cell culture system 
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according to the invention. Cell lines, in which the 
HCV- RNA construct or the HCV genome or subgenome 
replicates, are incubated with increasing concentrations 
of the relevant substance and the replication of the 
5 viral RNA is either determined by means of an introduced 
reporter gene or through the qualitative or quantitative 
detection of the viral nucleic acids or proteins. 
Resistance is given if no or a reduced inhibition of the 
replication can be observed with the normal 
10 concentration of the active substance. The nucleotide 
and amino acid replacements responsible for the therapy 
resistance can be determined by recldriing the HCV^RNA 
(for example by the means of Rt-PCR) and sequence 
analysis. By cloning the relevant replacement /s into the 
15 original construct its causality for the resistance to 
therapy c^n be proven. 

• The production of authentic virus proteins (antigens) 
for the development and/or evaluation of diagnostics. 
The cell culture system according to the invention also 
allows the expression of HCV antigens in cell cultures. 
In principle these antigens can be used as the basis for 
diagnostic detection methods. 

• The production of HCV viruses and virus-like particles, 
in particular for the development or production of 
therapeutics and vaccines as well as " for diagnostic 
purposes. Especially cell culture adapted complete HCV 
genomes, which could be produced by using the cell 
cultur system according to the invention, are able to 
replicate in cell culture with high efficiency. These 
genomes have the complete functions of HCV and in 
consequence they are able to produce infectious viruses. 

The HCV- RNA construct according to the invention by itself 
can also be used for various purposes in all its 
embodiments. This includes first of all: 
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• The construction of attenuated hepatitis C viruses or 
HCV-like particles and their production in cell 
cultures: 

Attenuated flCV or HCV-like particles can be created by 
accidental or purposefully introduced mutations, such as 
point mutations, deletions or insertions, which means 
viruses or virus-like particles with complete ability to 
replicate, but reduced or missing pathogenicity. These 
attenuated HCV or HCV-like particles can be used in 
particular as vaccine. 

* The construction of HCV-RNA constructs with integrated 
foreign genes, used for example as liver cell specific 
vector in gene therapy. Due to the distinctive liver 
cell tropism of the HCV and the possibility of replacing 
parts of the genome by heterologous sequences, HCV-RNA 
constructs can be produced, where for example the 
structural proteins can be replaced by a therapeutically 
effective gene. The HCV-RNA construct obtained in this 
way is introduced into cells preferably by means of 
transection, which express the missing HCV functions, 
for example the structural proteins, in a constitutive 
or inducible way. Virus particles, carrying the HCV-RNA 
construct, can be created by means of this method known 
to the expert under the term ' transcomplementatioh' . The 
particles obtained can preferably be used for the 
infection of liver cells. Within . these the 
therapeutically effective foreign gene will be expressed 
and will consequently develop its therapeutic effect. 

The detection of permissive cells, which means cells, in 
which a productive virus growth occurs. For this purpose 
either one of the HCV-RNA genomic constructs previously 
mentioned, which is able to form complete infectious 
viruses, or one of the HCV subgenomes previously 
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mentioned, Which according to the previously mentioned 
Example will be transfected in a cell line first, which 
expresses the missing functions in a constitutive or 
inducible way* is used, in each cas<e virus particles £re 
created, which carry a resistance and/or reporter gene 
apart from the HCV sequence. In order to detect cells, 
where the HCV is able to replicate, these cells are 
infected with viruses generated in this way and subject 
to an antibiotic selection or they are examined 
depending on the HCV-RNA construct by means of 
determining the presence of the expression of the 
reporter gene. Because an antibiotic resistance or 
reporter gene expression can only be established, when 
HCV-RNA construct replicates* the cells detected in this 
. way must be permissive. Almost any cell line or primary 
ceill culture can be tested in regard to the permissivity 
and detected in this. Way. 

The cell culture system according to the invention also 
permits targeted discovery of HCV-RNA constructs for which 
there is an increase in the efficiency of replication due 
to mutations. This occurs either by chance, in the context 
6f HCV-RNA replication, or by targeted introduction into 
the construct. These mutations, leading to a change in the 
replication of the HCV-RNA construct, are known to experts 
as adaptive mutations." The invention therefore also 
includes a method for obtaining cell culture adapted 
Mutants of a HCV-RNA construct according to the invention 
following the above description, in which the mutants have 
increased replication efficiency compared to the original 
HCV-RNA construct. It further includes a method for the 
production of mutants of a HCV-RNA full-length genome or of 
a HCV-RNA subgehome or of any HCV-RNA construct with 
increased replication efficiency compared to the original 
HCV-RNA full-length genome or subgenome or HCV-RNA 
construct, as well as cell culture adapted mutants of HCV- 
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RNA constructs, HCV-RNA * full-length genomes arid HCV 
subgenomes with increased replication efficiency compared 
to the original constructs, subgenomes or full-length 
genomes . 

. 5 

The method according to the invention for the production of 
cell culture adapted mutants of a HCV-RNA construct 
• according to the invention, in which the mutants have 
increased replication efficiency compared to the HCV-RNA 

10 construct* is characterised in that a cell culture system 
according to claim 1, in which the transfected HCV specific 
genetic material is a HCV-RNA construct with a selection 
gene according to claims 4 to 19, is cultivated on/in the 
selection medium corresponding to the selection gene, that 

15 the cultivated cell clones are collected and that the HCV- 
RNA construct is isolated from these cell clones. 

In an advantageous extension of this production method, the 
isolated HCV-RNA constructs are passaged at least one more 

20 time, that is they are transfected in cells of a cell 
culture system according to claim 1, the thus obtained cell 
culture system according to claim 1, in which the 
transfected HCV specific genetic material is the isolated 
HCV-RNA construct with a selection gene, is cultivated 

25 on/in the selection medium corresponding to the selection 
gene, the cultivated cell clones are collected and the HCV- 
RNA constructs are thus isolated. 

Using this process variation, the quantity of adaptive 
30 mutations and hence the degree of replication efficiency in 
the relevant HCV-RNA constructs can be increased even 
further . 

the method according to the invention for the production of 
35 mutants of a HCV-RNA full-length genome or of a HCV-RNA 
subgehdme or of any HCV-RNA construct with increased 
replication efficiency compared to the original HCV-RNA 
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fuli-length genome or subgenome or HCV-RNA construct, has 
the following features. Using one of the two production 
methods presented above, a cell culture adapted mutant of a 
HCV-RNA construct is produced, isolated from the cells, 
cloned using state of the art known methods and sequenced. 
By comparing with the. nucleotide and amino acid sequence of 
the* original HCV-RNA construct, the type, number arid 
position of the mutations is determined. These mutations 
are then introduced into an (isolated) HCV subgenome or 
full-length genome or any HCV-RNA construct, either by 
site-directed mutagenesis, or by exchange of DNA fragments 
containing the relevant mutations. 

A test can be carried out to determine or verify which 
mutations actually aire responsible for an alteration of 
replication efficiency, particularly an increase in 
replication, in this test the corresponding nucleotide 
ahd/or amino acid changes are introduced into the original 
HCV-RNA construct and the modified construct is then 
transfected in cell culture. If the introduced mutation 
actually leads to an increase in replication, then for a 
HCV-RNA construct with a selectable marker gene, the number 
of resistant cell clones in the artificially mutated 
Construct should be noticeably higher compared to the 
untreated construct. 

Ift the case of a construct with a reporter gene* the 
activity or quantity of the reporter should be noticeably 
higher for the artificially mutated construct compared to 
the untreated one. 

The cell Culture adapted HCV-RNA constructs with high 
replication efficiency according to the invention are 
Characterized in that, through nucleotide or amino acid 
exchanges, they are derivable from a HCV-RNA construct 
according to one of the claims 4 to 19, and that they are 
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obtainable using one of the two production processes 
presented above. 

These cell culture adapted HCV-RNA constructs can be used 
to produce any HCV-RNA constructs or HCV full-length or 
, subgenomes with increased replication efficiency. Both 
constructs with a selectable resistance gene and constructs 
without one or with a non-selectable reporter gene (e.g. 
luciferase) can be produced in this way, since replication 
of cell culture adapted HCV-RNA constructs can also be 
demonstrated in rion^selected cells due to their high 
replication efficiency. 

The cell culture adapted mutants of a HCV-RNA construct or 
HCV-RNA full-length genome or HCV subgenorrie with high 
replication efficiency compared to the original HCV-RNA 
construct or the original HCV full-length genome are 
characterized in that they are obtainable by a method in 
which the type and number of mutations in a cell culture 
adapted HCV-RNA construct are determined through sequence 
analysis and sequence comparison and these mutations are 
introduced into a HCV-RNA construct, particularly a HCV-RNA 
construct according to one of the claims 4 to 19, or into 
an (isolated) HCV-RNA full-length genome, either by site- 
directed mutagenesis, or by exchange of DNA fragments 
containing the relevant mutations. 

A group of preferred HCV-RNA constructs, HCV full-length 
genomes and HCV subgenomes with high and very high 
replication efficiency, which are consequently highly 
suitable for practical use is characterised in that it 
contains one, several or all of the amino acid or nucieic 
acid exchanges listed in table 3 and/or one or several of 
the following amino acid exchanges: 1283 arg -> gly ,1383 
glu -> ala , 1577 lys -> arg , 1609 lys -> glu , 1936 
pro -> ser , 2163 glu -> gly , 2330 lys -> glu , 2442 ile 
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-> val. (The numbers refer to the amino acid positions of 
the polyp rotein of the HCV isolate conl, see Table l)w 

5 Special features of the nucleotid sequences according to the sequence listings: 

S£<2 ID-NO: 1 

Ukme: I389/Core-3 1 /wt 
Composition (Nucleotide positions) : 
1.1-341: HCV 5' non-translated region 
10 2.342-1193: HCV Core Protein-Neoraycin Phosphotransferase 

fusion protein; selectable Marker 
3.1202-1812: internal ribosome entry site from 

ehcephalomydkarditis virus; directs translation of the 

downstream located HCV open reading frame 
15 4.1813-10842: HCV Polyprotein from Core up to nonstructural 

protein 5B 

5. 1813-2385: HCV Core Protein; structural protein 

6. 2386-2961: envelope protein 1 (El); structural 
protein 

20 7. 2962-4050: envelope protein 2 (E2); structural 

protein 

8. 4051-4239: Protein p7 

9. 4240-4890: nonstructural protein 2 (NS2); HCV NS2-3 
Protease 

25 io. 4891-6783: nonstructural protein 3 (NS3); HCV NS3 

Protease/Helicase 
11. 6784-6945: nonstructural protein 4A (NS4A) ; NS3 

Protease cof actor 
12.694 6-7728: nonstructural protein 4B (NS4B) 
30 13. 7729-9069: nonstructural protein 5A (NS5A) 

14,9070-10842: nonstructural protein 5B (NS5B) ; RNA- 

dependerit RNA-polymerase 
15.10846-11076: HCV 3* non-translated region 
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SEQ ID-NO: 2 " : . 

Name: I337/NS2-3 ' /wt 
Composition (Nucleotide positions): 
..1V1-341: HCV 5 1 non-translated region 
2, 342-1181: HCV Core Protein-Neomycin Phosphotransferase 

fusion protein; selectable Marker 
3.1190-1800: internal ribdsome entry site from 

e n ce pha 1 omy o ka r d i t i s virus; directs translation of the 

downstream located HCV open reading frame 
4.1801-8403: HCV Polyprotein frdm nonstructural protein 2 

up to nonstructural protein 5B 

5. 1801-2451: nonstructural protein 2 (NS2); HCV NS2-3 
Protease 

6. 2452-4344: nonstructural protein 3 (NS3); HCV NS3 
Protease/Heiicase 

7. 4345-4506: nonstructural protein 4A (NS4A) ; NS3 
Protease cofactot 

8. 4507-5289: nonstructural protein 4B (NS4B) 

9. 5290-6630: nonstructural protein 5A (NS5A) 
10.6631-8403: nonstructural protein 5B (NS5B) ; RNA- 

depehdeht RNA-polymerase> 
11.8407-8637: HCV 3' non-translated region 

SEQ ID-NO: 3 

Name : I389/NS3-3 ' /wt • w. . 
Composition (Nucleotide positions) : 
1.1-341: HCV 5 1 non-translated region 

2. 342-1193: HCV Core Protein-Neomycin Phosphotransferase 

fusion protein; selectable Marker 
3.1202-1812: internal ribosome entry site from 

encephalditlyokarditis virus; directs translation of the 

downstream located HCV open reading frame 
4.1813-7767: HCV Polyprotein from nonstructural protein 3 

up to nonstructural protein 5B 
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5. 1813-3708: nonstructural protein 3 (NS3); HCV NS3 
Protease /Helicase 

6. 3709-3870: Nonstructural protein 4A (NS4A) ; NS3 
Protease Cofactbr 

7. 3871-4653: Nonstructural protein 4B (NS4B) 

8. 4654-5994: Nonstructural protein 5A (NS5A) 

'". 9. 5995-7767: Nonstructural protein 5B (NS5B) ; FiNA- 
■ dependent RNA-Polymerase 
10. 7771-8001: HCV 3' non-translated Region 

SEQ Ifi-NO: 4 

Naihe: I337/NS3-3 1 /Wt 

Compos it ion (Nucleotide positions) : 

1.1-341: HCV 5 1 non-translated region 

2.342-1181: HCV Core Protein-Neoraycin Phosphotransferase 

fusion protein; selectable Marker 
3.1190-1800: internal ribosome entry site from 

encepha lomy o ka r di t i s virus; directs translation of the 

downstream located HCV open reading frame 
4. 1801-7758: HCV Polyprotein from Nonstructural protein 3 

up to Nonstructural protein 5B 

5. 1801-3696: Nonstructural protein 3 (NS3) ; HCV NS3 
Prbtease/Helicase 

6. 3697-3858: Nonstructural protein 4A (NS4A) ; NS3 
Protease Cbf actor — 

7. 3859-4641: Nonstructural protein 4B (NS4B) 

8. 4 642-5982: Nonstructural protein 5A (NS5A) 

9/ 5983-7755: Nonstructural protein 5B (NS5B) ; RNA- 

dependent RNA-Polymerase 
10. 7759-7989: HCV 3' non-translated Region 

SiQ ID-NO: 5 

Name: I389/NS2-3 ' /wt 

Composition (Nucleotide positions) : 



CA 02303526 2000-03-31 

- 24 - 

1. 1-341: HCV 5 f non^translated region 

2.342-1193: HCV Core Protein-Neomycin Phosphotransferase 

fusion protein; selectable Marker 
3.1202-1812: internal ribosome entry site from 

ericepha lomy o ka rd i t i s virus; directs translation of the 

downstream located HCV open reading frame 
4.1813-8418: HCV Polyprotein from Nonstructural protein 2 

up to Nonstructural protein 5B 

5. 1813-2463: Nonstructural protein 2 (NS2); HCV NS2-3 
. Protease 

6. 2464-4356: Nonstructural protein 3 (NS3); HCV NS3 
Protease/Helicase 

7. 4357-4518: Nonstructural protein 4A (NS4A) ; NS3 
Protease Cofactor 

8. 4519-5301: , Nonstructural protein 4B (NS4B) 

9. 5302-6642: Nonstructural protein 5A (NS5A) 
10.6643-8415: Nonstructural protein 5B (NS5B) ; RNA- 

dependent RNA- Polymerase . 
11. 8419-8649: HCV 3' non-trarislated Region 

SEQ ID-N&: 6 

Name: I389/NS3-3 ' /9-13F 
Composition (Nucleotide positions) : 
1.1-341: HCV 5' non-translated region 

2.342-1193: HCV Core Prot-ein-Neomycin Phosphotransferase 
fusion protein; selectable Marker 

3.1202-1812: internal ribosome entry site from 

ehcephalomyokarditis virus; directs translation of the 
downstream located HCV open reading frame 

4.1813-7767: HCV Polyprotein from Nonstructural protein 3 
up to Nonstructural protein 5B of the cell culture- 
adapted mutant 9-13F 

5. 1813-3708: Nonstructural protein 3 (NS3) ; HCV NS3 
Protease/Helicase 

6. 3709-3870: Nonstructural protein 4A (NS4A) ; NS3 
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Protease Cofactor 

7. 3871-4653: Nonstructural protein 4B (NS4B) 

8. 4654-5994: Nonstructural protein 5A (NS5A) 

9. 5995-7767: Nonstructural protein 5B (NS5B) ; RNA- 
depehdent RNFA-P6lymerase 

7771-8001: HCV 3' non-translated Region 

SEQ ID-NO: 7 

Name: t389/Core-3 ' /9-13F 
Composition (Nucleotide positions) : 
1.1-341: HCV 5 1 non-translated region 

2.342-1193: HCV Core Protein-Neomyciri Phosphotransferase 

fusion protein; selectable Marker 
3.1202-1812: internal ribosome entry site frbhi 

encephaldmyokarditis virus; directs translation of the 

downstream located HCV open reading frame 
4; 1813-10842: HCV Polyprotein from Core up to Nonstructural 

protein 5B of the cell culture-adapted mutant 9-13F 

5. 1813-2385: HCV Core Protein; structural protein 

6. 2386-2961: envelope protein 1 (El); structural 
protein 

7. 2962-4050: envelope protein 2 (E2) ; structural 
protein 

8. 4051-4239: Protein p7 

9. 4240-4890: Nonstructural protein 2 (NS2) ; HCV NS2-3 
Protease 

10.4891-6783: Nonstructural protein 3 (NS3); HCV NS3 

Protease/Mel icase 
11.6784-6945: Nonstructural protein 4A (NS4A) ; NS3 

Protease Cofactor 
12,6946-7728: Nonstructural protein 4B (NS4B) 
13.7729-9069: Nonstructural protein 5A (NS5A) 
14.9070-10842: Nonstructural protein 5B (NS5B) ; RNA- 

depehdeht RNA-Polymarase 
15.10846-11076: HCV 3' non-translated Region 
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SEQ ID-NO: 8 

Name: I389/NS3-3 75. 1 
Composition (Nucleotide positions) : 
- 1 - 1-341: HCV 5» non-translated region 
5 2.342-1193: HCV Core Protein-Neomycin Phosphotransferase 
fusion protein; selectable Marker 
3.1202-1812: internal ribosome entry site from 

encepha lomyo ka r d i t i s virus; directs translation of the 
downstream located HCV open reading frame 
.1813-7767: HCV Polyprotein from Nonstructural protein 3 
up to Nonstructural protein 5B of the cell culture- 
adapted mutant 5.1 

5. 1813-3708: Nonstructural protein 3 (NS3); HCV NS3 
Protease/Helicase 

6. 3709-3870: Nonstructural protein 4A (NS4A) ; NS3 
Protease Cofactor 

7. 3871-4653: Nonstructural protein 4B (NS4B) 

8. 4654-5994: Nonstructural protein 5A (NS5A) 

9. 5995-77 67: Nonstructural protein 5B (NS5B) ; RNA- 
dependent RNA-Polymerase 

7771-8001: HCV 3* non-translated Region 

SEQ ID-NO: 9 

Name: I389/Core-3 1 /5. 1 

Composition (Nucleotide positions) : 

1.1-341: HCV 5' non-translated region 

2.342-1193: HCV Core Protein-Neomycin Phosphotransferase 

fusion protein; selectable Marker 
3.1202-1812: internal ribosome entry site from 

encephalomyokarditis virus; directs translation of the 

downstream located HCV open reading frame 
4.1813-10842: HCV Polyprotein from Core up to Nonstructural 

protein 5B of the cell culture-adapted mutant 5.1 
5. 1813-2385: HCV Core Protein; structural protein 
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6. 2386-2961: envelope- protein 1 (El); structural 
protein 

7. 2962-4050: envelope protein 2 (E2) ; structural 
. protein 

fi* 4051-4239: Protein p7 
9. 424CK4890: Nonstructural protein 2 (NS2) ; HCV NS2-3 
Protease 

10.4891-6783: Nonstructural protein 3 (NS3); HCV NS3 

Protease/Helicase 
li. 6784-6945: Nonstructural protein 4A (NS4A) ; NS3 

Protease Cdfactbr 
12.6946-7723: Nonstructural protein 4B (NS4B) 
13; 7729-9069: Nonstructural protein 5A (NS5A) 
14.9070-10842: Nonstructural protein 5B (NS5G); RNA- 

dependent RNA-Polymerase 
15.10846-11076: HCV 3' non-translated Region 

SEQ ID -NO : 10 

Name: I389/NS3-3 ' /19 

Composition (Nucleotide positions) : 

1.1-341: HCV 5' non-translated region 

2.342-1193: HCV Core Protein-Neoiiiyciri Phbsphotransferas£ 
fusion protein; selectable Marker 

3. 1202-1812 : internal ribdsome entry site from 

ehcephalomyokarditis virus; directs translation of the 
downstream located HCV opeh reading frame 

4,1813-7767: HCV Polyprotein from Nonstructural protein 3 
up to Nonstructural protein 5B of the cell culture- 
adapted mutant 19 

5. 1813-3708: Nonstructural protein 3 (NS3J ; HCV NS3 
Protease/Helicase 

6. 3709-3870: Nonstructural protein 4A (NS4A) ; NS3 
Protease Cofactor 

7. 3871-4653: Nonstructural protein 4B (NS4B) 

8. 4654-5994: Nonstructural protein 5A (NS5A) 
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9. 5995-7767: Nonstructural protein 5B (NS5BJ ; RNA- 
dependent RNA- Polymerase 
7771-8001: HCV 3' non-translated Region 

SEQ ID-NO: 11 

Name: I389/Core-3 1 /19 

Composition (Nucleotide positions) : 

1.1-341: HCV 5' non-translated region 

2.342-1193: HCV Core Protein-Neomycin Phosphotransferase 

fusion protein; selectable Marker 
3.1202-1812: internal ribosome entry site froiiii 

encephalomyokardit is virus; directs translation of the 
downstream located HCV open reading frame 
4.1813-10842: HCV Polyprotein from Core up to Nonstructural 
protein 5B of the cell culture-adpated mutant 19 

5. 1813-2385: HCV Core Protein; structural protein 

6. 2386-2961: envelope protein 1 (El); structural 
protein 

7. 2962-4050: envelope protein 2 (E2); structural 
protein 

8. 4051-4239: Protein p7 

9. 4240^-4890: Nonstructural protein 2 (NS2); HCV NS2-3 
Protease 

10.4891-6783: Nonstructural protein 3 (NS3) ; HCV NS3 

Protease/Helicase 
11.6764-6945: Nonstructural protein 4A (NS4A) ; NS3 

Protease Cof actor 
12.6946-7728: Nonstructural protein 4B (NS4B) 
13.7729-9069: Nonstructural protein 5A (NS5A) 
14.9070-10842: Nonstructural protein 5B (NS5B) ; RNA- 

dependent RNA- Polymerase 
,15.10846-11076: HCV 3' non-translated Region 
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The invention is . described in detail in the following by 
way of examples of embodiments and respective diagrams. 
These are as follows. 

#Ig. 1 A: the structure of a HCV-RNA construct according to 
the invention 

On the top a diagram of the structure of the 
complete parental HCV genome is given with the 
positions of the genes for the cleavage products 
core, El, E2, pi, NS2, NS3, NS4A, NS4B, NS5A, and 
NSSB Within the polyprotein, and the 5 1 and 3' 
ribri-translated regions (5 1 NTR arid. 3 1 . NTR) 
shown as a thick horizontal line -*>. and with the 
two positions selected for the creation of the 
subgenomic constructs, namely the position of the 
'GDD catalytic domain' of the NS5B RNA polymerase 
{GDD) and the position of the 3' end of the HCV- 
IRES (nucleotide positions 1 to 377 and 1 to 389) 
- drawn above the diagram of the genome. The 
numbers below the diagram of the genome indicate 
the respective nucleotide positions. 

Diagrams of the structure of two modified HCV-RNA 
constructs (subgenome) according to the invention 
are shown, below, consisting of the 5' HCV- I RES, 
the neomycin phosphotransferase gene (Neo R ) , the 
EMCV-IRES (E-I) and the HCV sequences of NS2 or 
N$3 up to the authentic 3 f end. The position of 
the 10-amino acid deletion comprising the NS5B 
polymerase GDD motive is marked with a triangle ( 
A). 

Fig. 1 B: The result of a denaturing forma ldehyde-agarose 
gel electrophoresis for the detection of 
replicating plus-strand RNA in transfected 
subpassaged Huh 7 cell clones. 
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The positions of HCV specific RNAs (arrows) and 
the 28S rRNA are specified to the right of lane 
12, the size (number of nucleotides) of the RNA 
marker («) is specified to the left of lane 1. 

t±g. 1 G: The result of a PCR test with subsequent Southern 
Blot to demonstrate the absence of integrated 
replicon DNA in most of the selected cell clones. 

The lanes 1 and 2 show the positive control, lane 
13 the negative control. The figures to the left 
of lane 1 indicate the size of the nucleotide 
marker molecules. 

Fig. 2 A: The result of a PCR test with subsequent Southern 
Blot for the detection of the sensitive exclusion 
of integrated replicon DNA (plasmid molecule 
I377/NS3-0 1 /wt) in a cell clone containing a HCV- 
RNA construct (9-13), 

The lanes 7 to 11 represent the result of a 
titration of DNA molecules of the construct 
I377/NS3-3 1 /wt without addition of total DNA of 
the cell clone 9-13, and the lanes 2 - 6 
represent . the result obtained with the same 
plasmid molecules with the addition of 1 pg 9-13 
DNA each prior to the PCR (for the purpose of 
ekcluding an inhibitor of the PCR in the DNA 
preparation) . Lane 13 represents the negative 
control (PCR without DNA template). Larie 1 shows 
the result, which was achieved with one pg total 
DNA of the cell clone 9-13. 

Fig. 2 B: The result of a Northern Blot test to quantify 
the amounts of HCV plus- and minus-strand RNA. 
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The arrows mark* the positions of replicoh RNA. 

The "plus" and "minus" details indicate the 
positive (plus) and negative (minus) polarity of 
5 the RNA controls, which have been applied to the 

gel* "Minus-strand" and "Plus-strand" indicate 
the specificity of the radioactive RNA probes . 

Fig. 2 C: The result of a formaldehyde-agarose gel 
10 electrophoresis after radioactive labeling of the 

intracellular replicated HCV-RNA to demonstrate 
the resistance of HCV-RNA replication to 
dactinomycin. 

15 Fig. 3 A: the detection of HCV specific antigens in the 
selected cell clones by means of 
immunoprecipitation after metabolic radioactive 
labeling. 

The lanes 7-9 represent authentic size marker 
(which have been obtained in Huh 7 cells after 
the transient expression of a HCV-RNA construct) ; 
identified HCV proteins are labeled on the left 
edge of lane 1, the molecular weights (in 
Kilodalton) are specified to the right of lane 9. 

Fig. 3 B: Results of an' immunofluorescence test to 
establish the subcellular localisation of HCV 
antigens. ' 

; : 30 ••' 

Fig. 4 : Diagram of the structure of a selectable HCV-RNA 
construct according to the invention (complete 
genome) consisting of the 5' HCV-IRES, the 
neomycin phosphotransferase gene (NeoR) , a 
35 heterologous IRES element, for example from the 

encephalomyocarditis virus (E-i), the complete 
{./ HCV open reading frame and the authentic 3 f NTR. 
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Diagram Of the structure of HCV-RNA constructs 
with inserted antibiotic resistance gene (A) 
within the nucleotide sequence encoding the 
polyprotein (monocistronic RNA construct), and 
(B) within the 3* NTR (bicistronic RNA 
construct) „ 

Diagram of the structure of HCV-RNA constructs 
with inserted reporter genes (A) as part of a HCV 
replicoh from NS3 to NS5B, - in the end the 
reporter protein is cleaved by viral or cellular 
proteases out of the polyprotein or the marker 
gene for selection (selection gene) or the 
resistance gene are transfected into the cells 
through co-transf ection -, (B) as part of a 
fusion gene composed of a resistance arid reporter 
gene (for example for the neomycin 
phosphotransferase arid green fluorescent 
protein), (C) as part of a replicoh composed of a 
resistance and reporter gene (for example the 
neomycin phosphotransferase and green fluorescent 
protein) bound via a nucleotide sequence, which 
encodes for an amino acid sequence (area with 
hatches), which can be cleaved by a protease or 
has a self -cleaving (autocatalytic) activity, (D) 
as independent' gene (in this case the green 
fluorescent protein) , which is expressed from its 
own internal ribosonie binding site (IRES); - the 
resistance gene (in this case the neomycin 
phosphotransferase gene) is also expressed from 
its own internal ribosome binding site (IRES) 
(polycistronic construct) . 

Diagram of the structure of a HCV-RNA construct, 
where the resistance gene is linked to the HCV- 



CA 02303526 2000-03-31 

- 33 - 



RNA sequence via a ribozyme or a recognition site 
for a ribozyme. 

The thick lines illustrate the HCV 5' and 3' 
NTRs, E-I is a heterologous internal . ribosome 
binding site, which is required for the 
expression of the resistance gene, and the grey* 
square illustrates the ribozyme or a recognition 
site for a ribozyme. 

Diagram of the structure of a HCV-RNA construct 
with resistance gene and integrated foreign gene. 

Method for comparing the specific infectivity 
(expressed as number of cell colonies obtained) 
of total RNA against in vitro transcripts. HCV- 
RNA is generated by in vitro transcription of a 
corresponding RNA construct and quantified by 
measurement of the optical density at 260 nm (OD 
260 rim). A defined number , of these molecules is 
mixed with a specified amount of total RNA from 
naive Huh-7 cells and this mixture is transfected 
into naive Huh-7 cells with the aid of 
elect ropo ration. At the same time the total RNA 
of a cell clone, produced by the methbd described 
in Figure 1, is isolated using a known state of 
the art method and the amount of HCV-RNA 
contained therein is determined by means of 
Northern Blot using a HCV specific RNA probe and 
subsequent quantification via phosphoimaging. A 
defined amount of this total RNA is analogously 
transfected in naive Huh-7 cells. These cells 
from both the cultures are then subjected to a 
G418 selection and the number of colonies created 
is determined by counting after fixing and 
staining with Coomassie Brilliant Blue. For the 
determination of transfection efficiency lug of a 
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plasiriid allowing the expression of luciferase is 
added to each transfection culture An aliquot of 
the transfected cells is collected after 24 hours 
iarid the luciferase activity determined in the 
5 respective cell lysates. The number of colonies 

is always normed to the luciferase expression. 

Fig. iO: Sequence analysis of the 9-13 clone. Totai RNA of 
the 9^13 cell clone, resulting from transfection 
10 of HCV-RNA construct I377/NS3-3', was isolated 

using a known state of the ai:t method and the 
HCV-RNA construct amplified from nucleotides 
position 59 to 9386 with the aid of 'long- 
distance RT-PCR' using primer S59 and A9413. the 

15 PCR fragments were cloned and 11 clones (§-13 A - 

K) completely sequenced. Clones D and I, E and G 
as well as H and J turned out to be identical, 
respectively. The positions of the amino acid 
differences in the NS3-5B region between the 

20 recloned HCV-RNAs and the parental construct are 

marked with a thick vertical mark for each clone. 
Each clone was digested with restriction enzyme 
Sfil and the respective fragment inserted in the 
parental construct. These clones were each 

25 transfected in Huh-7 cells and the cells 

subjected to selection as described in Figure -1. 
The number of cell clones obtained with each 
construct is noted next to the respective 
construct on the right. 

36 

Fig. 11 A: Principle of determination of replication With 
the aid of a reporter gene, in the upper part of 
the figure, the HCV-DNA construct l38 9 /Luc/NS3-3 1 
is shown. It consists of the HCV 5' NTR 
35 (nucleotide positions 1-389), the luciferase gene 

{Luc), the IRES of the encephalomyocarditis 
virus, the HCV NS3-5B and the 3' NTR. The 



CA 02303526 2000-03-31 

- 35 - 



position of the active centre of the NS5B RNA 
polymerase, into which a deactivating amino acid 
exchange was introduced, is indicated by ' GND' ■. 
, The plasmids, which code for the HCV-RNA 

5 construct which is able to replicate or is 

defective, are digested with the restriction 
enzyme Seal and added to ah in vitro 
transcription reaction with T7 RNA polymerase. 
After removal of the template DNA, the respective 
10 HCV-RNA constructs were transfected in naive Huh- 

7 cells by means of electroporatibn and the 
latter collected at regular intervals. 

Fig. 11 B: Comparison of lucif erase activity in cells 
15 transfected with the parental HCV-RNA construct 

l389/Luc/NS3-3 ' /wt (wt) or the following variants: 
inactive RNA (318 DN) , variants 9-13F or variant 
5.1. the cells were collected 6 (not shown), 24, 
48, 72, 96, 120, 144 and 168 hours after 
20 transfection and luciferase activities determined 

by luminometric measurement. 

Fig. 12: Selectable HCV full-length genomes (constructs 
I 3B9 /core 3V5.1 and I 38 9/core 3V9-13F). 
25 (A) Diagram of the full-length construct . The 

region between both indicated recognition sites 
for the restriction enzyme Sfil corresponds to 
the sequences of the highly adapted RNA variants 
5.1. or 9-13F. 

3d (B) Number of colonies which were obtained after 

transfection of 0.1 jig in vitro transcribed RNA 
of the construct I 38 9/core-3 V5. 1 . described under 
A into HUH7-cells. The result of a representative 
experiment is given. 

35 (C) Demonstration of autonomously replicating HCV 

full-length RNAs in G418 resistant cell clones 
which were obtained after transfection of the 
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corresponding in vitro transcript; The 
illustration shows the autoradiogram of a 
Northern Blot, which was hybridised with a probe 
against the neo- resistance gene and the HCV 5' 
5 NTR. The controls shown in lanes 1 and 2 each 

correspond to 10 8 molecules 6f the indicated in 
vitro transcripts^ mixed with total RNA from 
naive Huh-7 cells. The negative control contains 
only total RNA from naive Huh-7 cells (lane 3). 

10 Lanes 4-9 contain 3-10 pg total RNA from G418 

resistant cell clones, which were obtained after 
transf ection by in vitro transcribed I 38 9/core 
3V5.1 RNA or I 389 /core 3V9-13F RNA. The G418 
concentration used for the selection is indicated 

15 in each case. Five of the cell clones shown 

contain the highly adapted RNA variant 5.1 (lanes 
4^8), one contains the adapted RNA variant 9-13F 
(lane 9) . 

20 Fig. 13: HCV-RNA constructs with a reporter gdne. (A) 
Bicistronic HCV-RNA constructs, fhe reporter gene 
is translated with the aid of a separate IRES. 
(B) Monocistronic HCV-RNA constructs. The 
reporter gene product is expressed as fusion 

25 protein with a HCV protein. Both portions are 

linked via a recognition sequence for a viral or 
cellular protease, which permits a proteolytic 
separation of the two fused protein portions. In 
the example shown the reporter gene product and 

30 respective HCV protein was fused through a 

recognition sequence for ubiquitin (Ub) . 

Fi.gr • 14: Tricistronic full-length HCV-RNA construct, that 
in addition to the resistance gene possesses an 
35 inserted foreign gene (ther. gene) . 
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Fig. 15: Mohocistronic HCV-RNA constructs, for which the 
resistance gene product is expressed as a fusion 
protein with HCV portion. The resistance gene 
(RG) is either active as a fusion protein or it 
is fused with the HCV portion Via a 
proteolytically cleavable sequence in such a way 
that the resistance gene product is split from 
the HCV portion by a cellular or viral protease. 
In the example shown the resistance gene was 
fused with the respective HCV portion through the 
sequence coding for ubiquitin (Ub) . 



Example 1 : Production of HCVHRiNA constructs 

(A): Synthesis, and cloning of a complete HCV consensus 
genome by. means of RT-PCR 

The HCV genome, which means the HCV-RNA, was isolated from 
the liver of a chronically infected patient as described in 
the following: . 

The total RNA was isolated from apprbx. 100 mg of liver 
according to the method described by Chomczynski and Sacci 
(1987, Anal. Biochem. 162, 156). Using 1 \iq of this 
isolated RNA a reverse transcription was carried out with 
the primers A6103 (GCTATCAGCCGGTTCATCCACTGC) or A9413 
(CAGGATGGCCTAttGG CCTGGAG) and the expand reverse 
transcriptase system (Boehringer Mannheim, Germany) 
according to the manufacturer's recommendations. A 
polymerase chain reaction (PCR) was carried out with the 
products of this reverse transcription using the expand 
long template system (Boehringer Mannheim, Germany), in 
which the buffer containing 2 % of dimethyl sulfoxide was 
used. After one hour at 42 °C, 1/8 of the reaction mixture 
was used for a first PCR with primers A6103 and S59 
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(TGTCTTGACGCAgAAAGCGTCTAG) or A9413 and S4542 (GATGAGCT 
CGCCGCGAAGCTGTCC) . After 40 cycles, 1/10 of this reaction 
mixture was used for a second PCR with primers S59 and 
A4919 (AGCACAGCCCGCGTCATAGCACTCG) . or S4542 and A9386 
(TTAGCTCCCCG TTCATCGGTTGG . After 30 cycles the PCR products 
were purified by means of preparative agarose gel 
electrophoresis and the eluted fragments Were ligated into 
the vector pCR2.1 (Invitrogen) or pBSK II (Stratagene) . 
Four clones of each fragment were analyzed and sequenced, 
and a consensus sequence was established. For this purpose 
the DNA. sequences were compared to each other. The 
positions, where the sequence of one fragment was different 
from the others, were considered as undesired mutations. In 
the case of ambiguities of the sequence, shorter 
overlapping PCR fragments of the respective region were 
amplified and several clones sequenced. By this means 
several potential mutations could be identified in each 
fragment and consequently aft isolate specific consensus 
sequence could be established. This established consensus 
sequence or genome belongs to the worldwide spread genotype 
lb. The non-translated region at the 3' end (=3 ' NTR) was 
obtained by means of a conventional PCR, whereby an 
aritisense primer, which covers the last 24 nucleotides of 
the ■X-tail 1 - known from the state of the art was used 
(Tanaka et al., 1995, Biochem. Biophys. Res. Commun. 215, 
.744; und Rice, PCT/US 96/14033) . The authentic non- 
translated region on the 5'" end (=5 ■ NTR) downstream of the 
T7 promoter was created by means of PCR, whereby an 
oligonucleotide, which corresponds to a shortened T7 
promoter (tAA TAC GAC TCA CTA TAG) and the first 88 
nucleotides of HCV, was used on one hand and one of the 
previously mentioned plasmids> which carries one of the 5' 
fragments of the genome, was used on the other hand. A 
complete HCV consensus genome was assembled from the 
subgenomic fragments with the smallest number of non- 
consensus replacements and inserted into a modified pBR322 
vector. Deviations from the consensus sequence were 
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eliminated by means of site-directed mutagenesis. In order 
to produce run-off transcripts with an authentic 3* end, 
the 3' NTR of the isolates (with the . end TGT) was modified 
to AGT (according to the sequence of the gehbtype 3 = clone 
•ftS' according to Kolykhalov et al., 1996, J. Virol. 70> 
3363) and an additional nucleotide replacement was carried 
but at position 9562, to preserve the A: T pairing in the 
hairpin structure at the 3' end of the 3' NTR (Kolyhalbv et 
al. ibid.). In order to eliminate an internal restriction 
site for the Seal enzyme, a silent nucleotide replacement 
was further carried out. After joining the full-length 
genome With the appropriate 5' and 3' NT Rs the complete HCV 
sequence was analyzed. No undesired nucleotide replacement 
was detected. 

The HCV genome produced in this way should be hepatotropic 
according to the definition. 

(B) Synthesis of selectable HCV subgeriomic cbhstructs 

By means of the consensus genome described under (A) , HCV 
subgeriomic constructs were created, which include the 
antibiotic resistance gene neomycin phosphotransferase 
(NPT) and two sequences of internal ribosome entry sites 
(IRES) . The biochemical procedures used for this are known 
and familiar to the expert (see: Sambrbok, J. , E. F. 
Fritsch, T. Maniatis, 1989, Molecularcloriirig: a laboratory 
manual, 2nd ed., Cold Spring Harbour Laboratory, Cold 
Spring Harbor, N.Y.; Ausubel et al. feds'. ) , 1994, Current 
Protocols in Molecular Biology, Vol. 1-3, John Wiley & Sons 
Inc., New York). the antibiotic resistance gene was 
inserted immediately after the 5 1 NTR, obtaining a 
bicistronic RNA as a result (see Fig. 1 A). However, the 
antibiotic resistance gene might as well be inserted at 
another site of the HCV subgenomic construct, for exaimple 
within the nucleotide sequence encoding for the 
pblyprotein, obtaining a monocistronic RNA as a result (see 
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Fig. 5 A) , or in the 3' NTR (see Fig. 5 B) . The IRES 
elements are one of the two HCV-IRES variants from 
nucleotides 1-377 or nucleotides 1-389 as well as the IRES 
of the encephalomyocarditis virus, which controls the 
translation of the HCV sequence downstream from the genes 
for NS2 or NS3 up to the authentic 3' end of the genome. 

The two HCV-IRES variants mentioned weire determined in the 
following Way: 

Based oh deletion analyses of the 3' end of the HCV-IRES 
(Reynolds etal. 1995, EMBO J. 14, 6010) various segments of 
the 5' NTR were fused with the NPT gene and analyzed with 
regaird to the maximum number of colonies formed by means of 
co-trahsf ectidn with a plasmid containing the T7 RNA 
polymerase gene. The best results were achieved with the 
HCV sequences from 1-377 to 1-389. Due to th<e fact that the 
AUG start codon of the HCV polyproteiri is locaited at 
position 342 and is consequently included in the IRES 
sequence, 12 or 16 amino acids of the HCV core protein 
("core protein") were fused with the neomycin 
phosphotransferase (see Fig. 1 A) . 

Accordingly, these modified HCV subgendmic constructs were 
given the designations I377/NS2-3' (or I377/NS3-3') and 
I389/NS2-3 1 (or I389./NS3-3') . They are illustrated in a 
diacfram in Fig. 1A. 

Different cell lines and primary cell cultures of human 
hepatocytes were transf ected with in vitro transcripts of 
these modified parental HCV subgenomic constructs I377/NS2- 
3 ' (or I377/NS3-3 ') and I389/NS2-3 1 (or I 38 9/NS3-3 « ) . 

As a parallel negative control for all transfection 
experiments, an appropriately modified, but defective 
subgenome was constructed for every modified parental HCV 
subgenomic construct, which differs from the parental 
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construct due to the fact, that it has a deletion of 10 
amino acids, within the reading frame comprising the active 
site of the NS5B RNA polymerase (Behrens et al., 1996, EMBO 
J. 15, 12; and Lbhmann et al., 1997, J. Virol. 71, 8416). 

(C) Synthesis, of selectable HCV genomic constructs 

A NS2-3 ' subgehomic construct, linked at its 5' end to a 

fragment oi the lucif erase gene and the complete EMCV-iRES, 

Was restricted with Ncol and Spel and purified using 

■ *■ ... 
preparative agarose gel electrophoresis. The vector 

obtained in this way was ligated with a NcoI/Notl-HCV 
fragment corresponding to the nucleotide positions 342 to 
1968 of the HCV genome and a Not I /Spel fragment 
corresponding to the nucleotide positions 1968-9605 in a 3- 
factor-ligation. the resulting construct, where the 
complete HCV open reading frame and the 3 1 NTR lie 
downstream from the luciferase gene fragment and the EMCV- 
IRES, was then restricted with Pmel and Spel and ligated 
with the analogously restricted I 389 /NS3-3 1 /wt subgendmic 
construct vector. This selectable HCV genomic construct is 
illustrated in Fig. 4. 

(D) Production of in, vitro transcripts corresponding to the 
HCV-RNA constructs 

the previously described purified plasmid DNAs Were 
linearized with Seal and used for an in vitro transcription 
r^abtion after phehoi/chloroforni extraction and isopfopahol 
precipitation using the following components: 80 mM HEPES, 
pH 7.5, 12,5 mM MgCl 2 , 2 mM Spermidine* 40 mM 
Dithiothreitol, 2 mM of each NTP, 1 unit RNaisin/pl, 50 
jjg/ml restricted DNA and apprbx. 2 units/pl T7 RNA 
polymerase. After 2 hrs. at 37 °C half of the amount of T7 
polymerase was added and the reaction mixture was incubated 
for two further hours. In order to remove DNA the mixture 
was extracted with acid phenole (U. Kedzierski, J.C. Porte, 
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1991, Bio Techniques 10, 210), precipitated with 
isopropanol, the pellet was dissolved in water and 
incubated with DNase (2 units per pg DNA) for 60 min. at 37 
°C. After subsequent extraction with acid phenole, acid 
phenole/chloroform and chloroform as weil as isopropanol 
precipitation the dissolved RNA was quantified using 
optical density measurement and its integrity was checked 
using f ormaldehyde-agarose gel electrophoresis. 

Example 2 : Trans fee tion experiments with the hepatoma cell 
line Huh 7 

With all tfansfection experiments it was carefully ensured 
that any template DNA had been removed beforehand so as to 
avoid the possibility of this DNA integrating in 
transfected cells and conferring a neomycin resistance to 
them independent from HCV replication. The reaction mixture 
was therefore treated with 2 units of DNase per pg DNA for 
60 min. at 37 °C after the in vitro transcription and 
extracted with acid phenole, acid phenole/chlorof orm and 
chloroform. Prior to being used for the transf ection the 
precipitated RNA was analyzed using f ormaldehyde-agarose 
gel electrophoresis. 

Three separate transfection experiments were carried out 
with the highly differentiated human hepatoma cell line Huh 
7 (according to Nakabayashi et al. 1982, Cancer Res. 42, 
3858). Each time 15 \ig RNA were introduced into 8 x 10 6 Huh 
7 cells by electrdporation and the cells were seeded in 
culture dishes with a diameter of 10 cm. 24 hours after 
seeding, neomycin (= G418) was added in a final : 
concentration of 1 mg/ml. The culture medium was changed 
twice a week. After 3^5 weeks small colonies were 
visible, which were isolated and grown under the same 
culture conditions. 
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The cell clones obtained during the first experiment were 
isolated and subpassaged. Most of the clones . died during 
this procedure/ and the final yield was only 9 clones from 
eell£, which had been transfected with the parental HCV 
Subgehomic constructs and 1 clone (clone 8-1) from cells, 
which had been transfected with a defective HCV genomic 
construct, namely a defective NS2-3' HCV-RNA, Apart from an 
extended doubling time and the occasional occurrence of 
irregularly shaped cells, no consistent morphological 
differences were found between these 9 cell clones and the 
single cell clone (clone 8-1) or the parental Huh 7 cells. 

The main criteria for functioning HCV genoihic constructs 
are the formation of viral RNA with the correct size and 
the absence of (integrated) plasmid DNA, which could 
transfer or mediate on a G418 resistance. 

To determine the HCV-RNA in the Huh 7 cells, the total RNA 
was isolated and analyzed by means of the common Northern 
Blot method using a plus-strand specific ribo probe (RNA 
probe) , For this purpose the total RNA was isolated from 
the respective cell clones according to the method 
described by Chomczynski arid Sacchi 1987/ Anal. Biochem. 
162, 156 and 10 pg RNA, which is equivalent to the total 
RNA content of 0,5 - 1 x 10 6 cells, are separated using 
denatured f ormaldehyde-agarose gel electrophoresis (lanes 3 
to 12 of Fig, 1 B) , At the same time' 10 9 in vitro 
transcripts (ivtr,), which correspond to the I 38 9/NS2-3 1 /wt 
or the I 3 89/NS3-3' /wt replicori RNAs, are separated as well 
as size markers with authentic sequence (lane 1 or lane 2). 
The separated RNA was transferred onto nyidn-based 
membranes and hybridized with a radioactively labeled plus- 
strand specific RNA probe, which was complementary to the 
complete NPT gene and the HCV-IRES of nucleotide 377 to 
nucleotide 1. The positions of the HCV specific RNAs 
(arrows) and the 28S rRNA are specified to the right of 
lane 12 , the size (amount of nucleotides) of the RNA marker 
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is specified to left of lane 1. The RNA marker fragments 
contain HCV sequences and therefore hybridize with the ribo 
probe (= RNA probe). The results of this analysis are 
illustrated in Fig. 1 B. 

With the exception of clone 8^1 transfected with the 
defective HCV genomic construct , all cell clones produced 
homogenous HCV-RNAs of correct lengths (approx. 8640 
nucleotides in the okse of the NS2-3' and approx. 7970 
nucleotides in the case of NS3-3 1 replicon) . This result is 
an indication of the fact, that the functional replicons or 
the functibrial HCV genomic constructs transfer the G418 
resistance. In order to exclude, that the G418 resistance 
is caused by a plasmid DNA, which is integrated in the 
genome of the Huh 7 host cell and transcribed under control 
of a cellular promoter, the DNA of each clone was analyzed 
by means of a NPT gene specific PGR. Consequently the DNA 
was isolated from the selected Huh 7 cell clones by means 
of digestion with proteinase K (40pg/ml, lh, 37 °C) in 
lOniMTris, pH7,5, lrtiM EDTA, 0/5 % SDS and subsequent 
extraction with phenol, phenol/chloroform and isopropanol 
precipitation. The DNA precipitate was dissolved in 10 rtiM 
Tris (pH 7,5) and 1 mM EDTA and incubated with Rnase A for 
1 hour. Following a phenol/chloroform extraction and 
ethanol precipitation, 1 pg DNA, equivalent to 4 - 8 x 10 4 
cells, was analyzed by means of PCR using NPT gene specific 
primers ( 5 ' -TCAAGACCGACCTG TCCGGTGCCC-3 1 and 5 • - 
CTTGAGCCTGGCGAACAGTTCGGC-3 1 ) / and a DNA fragment consisting 
of 379 nucleotides was generated. The specificity of the 
PCR product was established by means of the Southern Blot 
method, in which a DNA fragment labeled by digoxigenin was 
used, which corresponds to the NPT gene. As positive 
controls (for the detection of possibly contaminating 
nucleic aicidis) the PCR method was carried out with 10 7 
plasmid molecules or 1 \xg DNA from a BHK cell line, which 
was stably transfected with a neomycin resistance gene, and 
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as negative control, the PCR was carried out with the same 
reagents but without added DNA. 

The results of this analysis are illustrated in Fig, 1 C. 
The lanes 1 and 2 represent the positive Controls, lane 13 
represents the negative control. The numbers to the left of 
lan£ 1 indicate the sizes of the nucleotide marker 
molecules. 

A NPT-DNA could not be detected in any cell, clone, apart 
from clone 7-3 (Fig. 10, lane 3), which was obtained from 
bells after transfection with a NS2-3' replicon/NS2-3 1 HCV 
genomic construct, and clone 8-1 (Fig. 1C, lane 12), which 
was obtained frorn cells after transfection with a defective 
HGV genomic construct. This result was ariother indication 
of the fact, that the G418 resistance of most clones was 
passed on by the replicating HCV-R^A. But even regardless 
of these results/ it is unlikely, that integrated plasmid 
DNA produces HCV-RNAs of correct sizes, because the 
plasmids used for the in vitro transcription contain 
neither ah eukaryotic promoter nor a polyadenylation 
signal. In the case of clone 7-3 the resistance is 
therefore very probably passed on by the HCV-RNA construct 
or the replicating HCV-RNA as well as by an integrated NPT 
DNA sequence, whereas the resistance of the cells of clone 
8-1 is only caused by the integrated plasmid DNA. 

Clone 9-13 (Fig. 1 B, lane 11) . was subject tb further 
tests, to confirm that the G418 resistance was passed on by 
a self ^replicating HCV-RNA. Clone 8-1, which carries the 
integrated copies of the NPT gene, was used throughout as 
negative control. A PCR was carried out, which allows the 
detection of < 1000 NPT gene copies in ~ 40.000 cells, with 
the aim to rigorously exclude the presence of NPT-DNA in 
clone 9-13. The result of this PCR is illustrated in Fig. 
2A. The PCR proceeded in detail as follows: 
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During the test, 10 6 - 10 2 plasmid molecules (I377/NS3-- 
3'/wt) were used either directly (lanes 7 - 11) dr after 
adding each 1 pg 9-13 DNA (lanes 2 - 6) . The specificity of 
the amplified DNA fragment was determined by Southerh Blot 
. using a NPT specific probe. A PCR without DNA probe was 
carried out as negative control (lane 12). . 

Even with this sensitive method, hd plasmid DNA could be 
detected in one pg DNA of the cell clone 9-13 (larie 1) . To 
estimate the amount of HGV plus- and minus-strand RNAs in 
these cells, a dilution series of total RNA was analyzed 
with the Northern Blot method using a plus- or minus-strand 
specific radioactively labeled ribo probe (= RNA probe). 
For this purpose, 8/ 4 or 2 pg of total RNA, which have 
been isolated from the cell clones 9-13 arid 8-l> were 
analyzed in parallel to known amounts of in vitro 
transcripts with plus- or mihus-strand polarity (control 
RNAs) in the Northern Blot method and were then subjected 
to a hybridization. The hybridization was carried but with 
a plus-strand specific ribo probe, which covered the 
complete NPT gene and the HCV-IRES ('plus-strand 1 , top 
panel) , or with a minus -strand specific RNA probe, which 
was complementary to the NS3 sequence ('minus-strand 1 , 
bottom panel), the arrows mark the positions of replicon 
RNA. The results of this analysis are illustrated in Fig. 
2 B. 

Approx . Id 8 copies/pg total RNA were detected in the case 
of the plus-strand, which is equivalent to 1000 - 5000 HCV- 
RNA molecules per cell, whereas the amount of minus-strand 
RNA was 5- to 10-times less. This result corresponds to the 
assumption, that the minus-strand RNA is the replicative 
intermediate form or intermediate copy, which is used as a 
template for the synthesis of the plus-strand molecules. 

Due to the fact that the reaction is mainly catalyzed by 
the viral RNA-deperident RNA polymerase, the synthesis of 
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thk HCV-RNAs should be resistant to dactinomycin, an 
antibiotic, which selectively inhibits the RNA synthesis of 
DNA templates, but not the RNA synthesis from RNA 
■templates. To confirm this assumption, cells were incubated 
with [ 3 H] uridine in the presence of dactinomycin, the 
radioactively labeled RNAs were Extracted, separated by 
mieans of denaturing agarose gel electrophoresis arid 
analyzed with the aid of a common Bio-Imager using a [ 3 H]- 
sensitive screen. For this purpose approx. 5 x 10 5 cells of 
the clones 9-13 arid 8-1 were incubated at a time with 100 p 
Ci [ 3 H] uridine for 16hrs. in the absence (-) or presence 
( + ) of 4 pg/mi of dactinomycin (Dactj . Following this 
labeling reaction the total RNA was prepared and analyzed 
by means of f brmaldehyde-agarose gel electrophoresis. Only 
1/10 of the total RNA is illustrated in the first two 
lanes. The radioactively labeled RNA was visualized using a 
BAS-2500 Bio-Imager (Fuji). 

The results of this analysis are illustrated in Fig. 2 C. 
In accordance with the inhibitor profile of the NS5B 
polymerase (Behrens et al., 1996, EMBOJ. 15, 12 and Lohmann 
et al., 1997, J. Virol. 71, 8416), dactinomycin had rio 
influence oh the replication of the HCV RNA, whereas the 
synthesis of cellular RNA was inhibited. A RT-PCR was 
carried out for recloning the replicating sequences, to 
confirm the identity of the viral RNA. ThV sequence 
analysis of the recloned RNA showed that the RNA in clone 
9-13 is HCV specific arid corresponds to the transfected 
transcript of the HCV construct I 3 7 7 /NS3-3 1 /wt . 

For the analysis of the viral proteins, at first the 
respective cells were metabolically radioactively labeled 
with [ 35 S] methionine/cysteine, then lysed and afterwards 
the HCV specific proteins were isolated from the cell 
lysates by means of immunoprecipitation. The results of 
these analyses are illustrated in Fig. 3 A. The detailed 
procedure was as follows: Cells of the cell clones 9-13 
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(wt) and 8^1 (A) were metabolically radioactively labeled 
by treating them with a protein labeling mixture familiar 
to the expert and available on the market (for example. NEN 
Life Science), the HCV specific proteins were separated 
from the cell lysate by immunoprecipitation (IP) under non- 
denaturing conditions (for example . according to 
Bartenschlager et al., 1995, J. Virol. 69, 7519) using 
three different antisera (3/4, 5A, 5B, according to the 
labeling on the top end of the lanes 1 to 12) . The immune 
complexes were analyzed by means of tricine SDS-PAGE and 
made visible by means of autoradiography. To obtain 
authentic size markers * the homologous replicon construct 
I377/NS3-3 ' /wt was subject to a transient expression by the 
vaccinia virus-T7 hybrid system in the Huh 7 cells. The 
resulting products were used as size markers (l&nes 7-9) 
parallel to the cells of the clones 9-13 and 8-1. 
Identified HCV proteins are labeled on the left edge of 
lane 1, the molecular weights (in Kilddalton) are specified 
on the right edge of lane 9. It should be noted that the 
NS3/4 specific antiserum (' 3/4 ' ) used preferably reacts 
with NS4A and NS4B causing an underrepresentation of NS3. 

All viral antigens could unambiguously be detected, and 
their apiparerit molecular weights did hot show any 
difference to those being detected after transient 
expression of the same bicistronic HCV^RNA construct in the 
original Huh 7 cells. An immuriof luorescence detection 
reaction was carried out using NS3 and NS5A specific 
antisera, to determine the subcellular distribution of the 
viral antigens (for example according to Bartenschlager et 
al., 1995, J. Virol. 69, 7519). For this purpose cells of 
the clones 9-13 (wt) and 8-1 (A) were fixed with 
methanol/acetone 24 hrs. after incubating on coverslips and 
incubated with polyclonal NS3 or NS5A specific antisera. 
The bound antibodies were made visible with a commercially 
available FITC conjugated anti-rabbit antiserum. The cells 
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were counters tained with the 'Evans Blue' stain to suppress 
uhspecific fludrescence signals. 

The results of this detection test are illustrated in Fig, 
3 B. A strong fluorescence in the cytoplasm could be 
detected with both antisera. The NS5A specific antiserum 
also caused a slight nuclear fluorescence, which indicates 
that at least small amounts of this antigen also reach the 
nucleus. But the generally dominating presence of the viral 
antigens in the cytoplasm are a strong indication that HCV- 
RNA replication occurs in the cytoplasm, as is the case 
with most RNA viruses. 

These results prove clearly that the establishment of a 
cell culture system for the HCV could be accomplished with 
the test arrangement described, the efficiency of which 
surpasses everything known up until now by far and for the 
fiirst time allows the detection of viral nucleic acids and 
proteins with conventional and approved biochemical 
methods. this efficiency actually allows detailed 
examination of HCV pathogenesis, genetic analyses of 
different HCV functions and a precise study of the 
virus/host cell interaction, through which new starting 
points for the development of a antiviral therapy can be 
defined. 



Example 3 : Transection of Huh 7 cells with HCV total 
constructs 

Huh7 cells are transfected and selected as described in 
example 2, whereby in this case selectable constructs arie 
used, which contain the complete virus genome* 
Corresponding to Example 2, the resulting cell- clones are 
tested for the absence of HCV-DNA by means of PCR and the 
productive replication of HCV-RNA is then established by 
means of Northern Blot, [ 3 H] uridine labeling in the 
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presence of dactinomycin, detection of the viral proteins 
or antigens preferably with the aid of the Western Blot, 
the immunoprecipitation or immunof luorescence. In contrast 
to the arrangements described in Example 2, the construct 
described here makes it possible to obtain more complete 
and very likely infectious viruses, which has not been the 
case in the subgenomic constructs described in Example 2. 
Thesfe viruses existing in the cell and the cell culture 
supernatant air£ concentrated for example by means of 
ultracehtf ifugation, immunoprecipitation or precipitation 
with polyethyleneglycdl, and all exogenous nucleic acids, 
which means those that arie not incorporated into the virus 
.particle r are digested by incubation with nucleases (RNase, 
DNase, micrococal nuclease) . In this way all contaminating 
nucleic acids, which are not included in the protecting 
virus particle, can be removed. After inactivation of the 
nucleases, the protected viral RNA is isolated for example 
by means of incubation with proteinase K in a buffer 
containing SDS, by extracting with phenol and 
phenol/chloroform and detected by means of Northern Blot or 
RT-PCR using HCV specific primers. Also in this test 
arrangement, the combination of the HCV consensus genome 
described with a selection marker was crucial for the 
efficient production of viral RNA, viral protein and 
therefore HCV particles. 

Example 4 : Production and application of a HCV- 
FNA construct, Whereby the resistance gene is 
linked to the HCV subgenomic sequence via a 
ribozyme or a recognition site for a ribozyme. 

A HCV-RNA construct is produced according to Example 1 or 
Example 3, where an antibiotic resistance gene is linked to 
the HCV- RNA sequence through a ribozyme or a recognition 
sequence for a ribozyme. These constructs are illustrated 
in a diagram in Fig. 7. Huh 7 cells are transfected with 
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this HCV-RNA construct as described in Example 2. A 
' selection with the appropriate antibiotic follows the 
. transf ection into the cells. The inserted ribozyme is 
activated in the cell clones obtained in the procedure or, 
iii the case of k construct, which carries a recognition 
sequence for a ribozyme, the ribozyme is transf ected into 
the cell (for example' by means of transf ection of a 
ribozyme construct or infection with a viral expression 
vector , into which the respective ribozyme has been 
inserted). In both cases the resistance gene is Separated 
from the HCV-RNA sequence by the ribozyme-dependerit 
cleavage . The result in the case of the HCV genome is an 
authentic HCV genome without a resistance gene, which can 
form authentic infectious virus particles. A HCV replicon 
without resistance gene is created in the case of the HCV 
subgenomic constructs. 

Example 5 : Co- transf ection of a HCV-RNA construct with a 
separate luciferase transfection construct 

A HCV-RNA construct is produced according to Example 1 (A) 
or Example 3 or Example 4. At the same time a transfection 
construct is produced, which comprises the luciferase gene, 
whereby this luciferase gene is linked to a first 
nucleotide sequence , which encodes a HCV protease (for 
ekample NS3 protease) cleavage site, to a second nucleotide 
sequence, which encodes for another protein or a part of 
another protein. HCV-RNA construct and transfection 
construct are transf ected into any host cells, preferably 
hepatoma cells, most preferably Huh 7 cells. This can be 
realised as described in Example 2. The product of the 
modified luciferase gene is a luciferase fusion protein, 
where the luciferase is inactivated due to the fusion with 
the foreign part, the fusion protein, which contains a 
recognition sequence for a HCV protease, is cleaved in 
transfected cells with high HCV replication, and 



CA 02303526 2000-03-31 
- 52' - 



drdhsequehtly the active form of the iucif erase, which can 
be identified through luminometric measurement, is 
released. If the replication of the HCV-RNA construct is' 
inhibited the fusion protein will not be cleaved and no 
active luciferase will be released. The quantitative 
determination of the luciferase is therefore a measure for 
the replication of the HCV construct. Instead of the 
luciferase gene, another reporter gene can just as easily 
be used, which is modified in the same way, so that its 
expression depends on the viral replication, although this 
reporter gene is not part of the HCV construct. A cellular 
protein, which is deactivated or activated by the HCV 
proteins or nucleic acid, can also be used as a so called 
surrogate iftarker. The expression or activity of this 
surrogate marker is in this case a measure for the 
replication of the viral DNA. 

Example 6 : Production of HCV subgenomic constructs with 
integrated foreign genes to be used as liver 
cell specific vector in gene therapy 

These recombinant and selectable HCV subgenomic constructs 
are transfected in transcomplementing helper cell lines, 
which means in cell lines, which express the missing 
functions (for example the structural proteinis) in an 
inducible or constitutive way. Cell clones containing a 
functional HCV subgenomic construct can be established 
through appropriate selection. The viral structural 
proteins expressed from the host cell allow the formation 
of virus particles, into which the RNA of the HCV 
subgenomic constructs will be transfected. The results are 
therefore virus-like particles, which contain a HCV 
subgenomic construct according to the invention including 
the inserted foreign gene arid which can transmit this to 
other cells by means of infection. An example for this 
construct is illustrated in Fig. 8. 
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It is also possible to use the HCV subgenomic construct 
with integrated foreign gene directly as ah expression 
vector. This involves the same method as mentioned 
5 previously except that cell lines, which do not express 
transcomplementing; factors, are transfected. In this case 
the HCV construct is only used as an expression vector. 

10 Example 7 : Production of cell culture adapted HCV-FNA 
constructs 

(A) Method of isolation 

15 the following method was used to determine adaptive 
mutations and to produce cell culture adapted HCV-RNA 
constructs: cells were transfected with a HCV-RNA construct 
as described in Examples 1 and 2 and G418-resistant cell 
clone produced. For the determination of ability to 
replicate (understood in this context to be the number of 
G418 resistant cell clones obtained per microgram of 
transfected HCV-RNA or HCV-RNA construct), the total RNA 
from one. of the cell clones, [the 9-^13 clone (Fig. I B, 
lane 11)], was isolated and the quantity of HCV-RNA 
contained within it was determined by Northern Blot as 
described in Fig. 2 B. Ten micrograms of the total RNA, 
containing approx. iO 9 "molecules of HCV-RNA, was then, 
transfected into naive Huh-7 cells using electroporation 
(Fig. 9). In parallel, 10 9 in vitro transcripts of the 
analoguous neo-HCV-RNA, which had been adjusted with 
isolated total RNA from naive Huh-7 cells to a total RNA 
quantity of 10 ug, were transfected in naive Huh-7 cells. 
After selection with G418, the number of cell colonies, 
expressed as 'colony forming units (cfu) per microgram 
RNA 1 , was determined in both these cultures. At a 
concentration of 500 pg/ml G418 in the selection medium, 
the number of colonies obtained with the HCV-RNA contained 
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in Isolated total SNA from clone 9-13, was approx. 100,000 
cfu per microgram HCV-RNA. In contrast, only 30 - 50 
colonies were obtained with the same quantity of in vitro 
. transcribed HCV-RNA. this result confirms that the specific 
5 infectivity of the HCV-RNA isolated from the cell clones is 
approx. 1,000 - 10,000 times higher than the infectivity of 
the an&loguous in vitro transcripts. The experimental 
approach is shown in Fig. 9. 

10 With the aid of 'long-distance RT-PCR', the HCV-RNA was 
amplified * from the total RNA of the 9-13 cells, the PCR 
amplificate was cloned and numerous clones were sequenced. 
A comparison of the sequences of these recloned RNAs with 
the sequence of the RNA originally transfected into the 
.15 naive Huh -7 cells, showed that the recloned RNAs possessed 
numerous amino acid exchanges distributed over the whole 
HCV sequence (Fig. 10). Sfil fragments of these recloned 
mutants were used to replace the analoguous Sfil fragment 
of the original replicon construct, and RNAs of the 

20 respective mutants were transfected in naive Huh-7 cells. 
After selection with G418 the number of colonies created 
was determined for each HCV-RNA mutant. While only 30 - 50 
colonies per microgram RNA were obtained with the parental 
RNA the number of colonies was noticeably higher for two of 

25 the recloned variants (Fig. 10). In the case of the HCV-RNA 
constructs 9-131 and 9-13C the specific infectivity was 
increased to 100 - l,000 cfu per microgram RNA and for 9- 
13F replicon it was 1,000 - 10,000 cfu per microgram RNA. 
These results show that the amino acid exchanges in the 

30 analysed NS3-5B regions . of the mutants 9-131, 9-13C arid 
. particularly of 9-13F, led to a considerable increase in 
ability to replicate. In contrast all the other HCV-RNA 
constructs (9-13 A, B, G, H und K) were no longer able to 
replicate> they thus contained lethal mutations. 

'. 35 

In order to answer the question which of the amino acid 
exchanges in the 9-13F construct led to an increase in 
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replication, the exchanges were introduced separately or in 
combination into the parental HCV-RNA construct, and the 
corresponding RNAs transfected in naive Huh-7 cells. The 
result of the transfectioh with these RNAs is summarised in 
Table 1. From this it is evident that in the present 
example the high ability to replicate is determined by 
several mutations. The amino acid exchanges in the HCV-RNA 
regions NS5A and NS4B make the greatest contribution. The 
single exchanges in the NS3-Region also make a contribution 
and perhaps they are synergistic. 

these results confirm that it was through the G418 
selection of the cells transfected with the neo- HCV-RNA 
construct that there was enrichment of those HCV-RNAs 
having noticeably higher ability to replicate. HCV-RNA 
constructs with greatly differing replication efficiencies 
can be selected using the experimental approach described 
here. The higher the concentration of the antibiotic in the 
selection medium, in/on which the HCV-RNA construct 
containing cells are cultivated for selection, the higher 
must be the extent of adaptive mutations and hence 
replication efficiency of the relevant HCV-RNA constructs, 
to allow the cells to grow under these conditions. If the 
selections are carried out using lower antibiotic 
concentrations, cells can survive and multiply/ but the 
HCV-RNA construct shows a comparatively lower replication 
efficiency and fewer adaptive mutations. 

As has been shown, the 9-13F HCV-RNA construct described so 
far, which contains several adaptive mutations, had a 
higher replication efficiency than the parental HCV-RNA. In 
order to obtain HCV-RNAs with even higher replication in 
cell culture, the HCV-RNA contained in the total RNA of a 
selected cell clone was passaged several times in naive 
Huh-7 cells. The selected 5-15 cell clone, was obtained by 
transfection with the HCV-RNA construct I389/NS3-3* (Fig. 
1). It largely corresponds to the cell clone 9-13, produced 
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by transfection with a HCV-RNA construct, having a HCV-IRES 
shorter by 22 nucleotides (I 3 -n/NS3-3 • ; Fig, 1). Ten 
micrograms of total RNA, isolated from cell clone 5-15, 
were transfected into naive Huh-7 cells using 
5 electropdration and the cells subjected to a selection with 
1 mg/ml G418. The total RNA from one of the cell clones 
thus produced was again isolated, transfected into naive 
•Huh-7 cells arid selected in the same way. This process was 
repeated a total of four times: After the fourth passage 

.10 the total RNA was isolated from a cell clone and the neo- 
HGV-RNA amplified with the aid of the 'long-distance RT- 
PCR' . The amplified DNA fragment was digested with the 
restriction enzyme Sfil arid inserted into the Sfil- 
/ restricted parental construct I 3 - 8 9/NS3-3 r • Over 100 DNA 

15 clones were obtained altogether arid theri analysed by means 
of restriction digestiori. In vitro transcribed RNA of about 
80 of these clones was each transacted into naive Huh^ 
cells and subjected to a selection with 500mg/ml G418. Of 
the 80 neo-HCV-RNA variants examined, the great majority 

20 proved to be replication defective. However, the specific 
infectivity, expressed as 'colony forming units 1 per 
. microgram RNA, was noticeably increased in the case of two 
mutants, 5.1 and 19 (Table 2) . Through several passages of 
the RNA in cell culture it is clear that HCV-RNAs are 

25 produced whose replication efficiency due to mutations 
(>,adaptive mutations") is several orders of magnitude 
higher than the original RNA cloned from patients. 

(B) Modified method 

30 ■■ 

Adaptive mutations produced arid identified according to (A) 
can be transferred irito a HCV-RNA construct with low 
ability to replicate. This leads to a huge increase in the 
replication of this coristruct. The increase is so igreat it 
35 can be demonstrated that HCV-RNAs transfected into cell 
culture can replicate even in the absence of selectiori 
pressure Fig. 12 shows a comparisori of the replication 
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efficiency of HCV-RNAs, which corresponded either to the 
starting sequence or to the adaptive sequences 9-13F or 
5.1. For the purposes of simple measurement, the neo-gene 
was removed and replaced by the gene for lucif erase. The 
negative control used was again a HCV-RNA construct that 
was replication defective due to a deactivating mutation in 
the NS5B RNA polymerase. Already 24 hours after 
transfection a noticeable difference is evident in 
luciferase activity between the defective RNA and the 9-13F 
or 5.1 constructs, while hardly any different could be 
seen between the defective RNA (3i8 DN) and the parental 
ftfcIA construct (wt) that possessed no adaptive mutations . 
During the whole period of observation, the highest 
luciferase activity, and thus highest replication, Was 
obtained with the 5.1-RNA. These results not only confirm 
the high replication efficiency of this RNA, but also Show 
that it is possible to create a cell culture system with 
adapted HCV-RNA constructs for which the presence of a 
selectable gene is no longer necessary. A summary of the 
nucleotide and amino acid differences between the starting 
construct and the mutants 9-13F, 5.1 and 19 is presented in 
fable 3. 

Example 8 : Production of cell culture adapted HCV-RNA 
full-length genome 

In the examples 1 to 7 a subgenomic HCV-RNA was used which 
lacked the whole structural protein region from core up to 
p7 or even NS2. It will be shown in this exaiiriple that it is 
possible to make a HCV full-length genome replicate in cell 
culture with the aid of an adapted NS3-5B siequence. For 
this purpose the Sfii fragment of the highly adapted HCV- 
RNA 5.1 produced according to Example 7 is first 
transferred into a selectable HCV full-length genome (Fig. 
12) . this HCV genome was transfected into naive Huh-7 cells 
and subjected to selection with various G418 
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concentrations. Depending on the strength of selection (the 
G4 18 concentration), a varying large number of cell clones 
wis obtained (Fig. 12 B) . By contrast no colonies were 
obtained with the parental . HCV full-length genome 
containing rib adaptive mutations , as was the case for the 
negative control, which was replication defective due to a 
deactivating mutation in the NS5B RNA polymerase. To 
confirm that the thus resulting cell clones really 
contained an autonomously replicating HCV full-length 
construct, total RNA from several cell clones was isolated 
and analysed by means of the Northern Blot method. The 
full-length HCV- RNA was clearly detectable in all cell 
clones (Fig. 12) . It is thus clearly confirmed, that with 
the aid of cell culture adapted HCV sequences it is 
possible to produce a HCV full-length genome, which 
replicates highly efficiently and autonomously in a cell 
line, i.e. adapted HCV full-length genomes can also be 
produced with the system of the invention. Furthermore, as 
this clone possesses the complete HCV sequence, i.e. it 
also possesses the structural proteins necessary for virus 
particle formation, it is possible to produce large 
quantities of infectious virus particles in cell cultures 
with this system. As a confirmation of thesie viruses, cell- 
free cell supernatants carrying a replicating HCV full- 
length genome, are added to naive Huh-7 cells and the thus 
infected cells subjected to selection with G418. Each cell 
clone growing under these conditions originates from an 
infected cell. The viruses in the cell culture supernatant 
of cells possessing a replicating HCV full-length genome 
can_be enriched and purified using various known state of 
the art methods such as ultracentrif ugation or 
Mcrodialysis. They can then be used for the infection of 
naive cells. Using this method it is clearly demonstrated 
that cell culture adapted full-length genomes can be 
produced with the HCV cell culture system of the invention. 
These genomes replicate with high efficiency in cells and 
produce infectious viruses. The latter can be detected by 
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infection of an experimental animal, preferably a 
chimpanzee. 

Example 9: Production of HCV full-length constructs arid 
HCV subgenomic constructs With reporter genes 

A HCV-RNA construct is produced in which a reporter gene is 
inserted in place of the antibiotic resistance gene (Fig. 
13). Replication can thereby be determined through the 
quantity or activity of the reporter gene or reporter gene 
product. The reporter "gene is preferably a gene from the 
group of the lucif erase genes, the CAT gene 
(chloramphenicol acetyl transferase gene) , the lacZ gene 
(beta galactosidase gene), the GFP gene (green fluorescence 
protein gene), the GUS gene (glucuronidase gene) or the 
SEAP gene (secreted alkaline, phosphatase gene). This 
reporter gene and its products, namely the relevant 
reporter proteins, can be detected for example using 
fluorescence, chemiluminescence, colorimetrically or by 
means of immunological methods (for example, enzyme-linked 
immunosorbent assay, ELISA) . 

The reporter gene can be expressed either from a separate 
IRES or in the form of a fusion protein, which is active 
either as such or fused with a HCV protein via a 
poteolytically cleavable amino acid sequence in such a way 
that the reporter is separated from the HCV protein by 
cleavage of a cellular or viral (HCV) protease. 

Example 10 : Production of HCV full-length constructs with 
integrated foreign genes used as liver cell 
specific vectors for gene therapy or as 
expression vectors: 

The construct (Fig. 14) is transfected in cells and leads 
to the formation of HCV virus particles that can be used 
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for the infection of further cells. Since the virus 
particles have encapsidated RNA with a foreign gene, it can 
be used in the infected cells for the production of the 
protein coded by this foreign gene. Cells transfect^d with 
the Construct also express the foreign gene. 

Example 11 : Production of monocistronic HCV-RNA constructs 
in which the resistance gene product is 
expressed as a fusion protein with the HCV 
portion. 

It is ah advantage for some tests if this HCV- RNA construct 
does not possess a heterologous IRES element. Tests of this 
type are; for example, the determination of interferon 
resistance. If a cell possessing a HCV- RNA construct is 
incubated with interferon alpha or beta, a reduction in 
replication of the HCV- RNA results. In order to explain the 
mechanism of this effect it is necessary for the HCV-RNA 
construct not to possess any heterologous IRES, as 
otherwise it is not possible to determine whether the 
interferon mediated inhibition is via inhibition of the HCV 
replication or inhibition of the heterologous IRES. For 
this reason constructs are produced for which the 
resistance gene is fused with a HCV protein (Fig. 15). 
Either the fusion protein is active as such or the 
resistance gene product "is linked to a HCV protein via a 
proteolytically cleavable amino acid sequence in such a way 
that it is separated from the HCV protein by a cellular or 
viral (HCV) protease. 
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. Table 1: Specific inf ectivities (cfu/pg RNA) of HCV RNA 
constructs with adaptive mutations found with the 9-13F 
mutant introduced into the parental construct I 38 9/NS3-3 • /wt 



amino acid exchange 


HCV protein 


cfu/pg RNA* 


none 




30-60 


1283 arg -> gly 


NS3 


200 - 250 


1383 glu -> ala 


NS3 


30 - 60 


1577 lys -> arg 


NS3 


30 - 60 


1609 lys -> glu 


NS3 


160 - 300 


(1283 arg -> gly + 1383 glu - 
> ala + 1577 lys -> arg + 
1609 lys -> glu) 


NS3 


360 - 420 


1936 pro -> ser 


NS4B 


1000-5000 


2163 glu -> gly 


NS5A 


1000-5000 


2330 lys -> glu 


NS5A 


30 - 60 


2442 ile -> val 


NS 5A 


30 - 60 


all together 




5000 

.... . - . /■ 



5 



1 amino acid change in the polyprotein c>f the : HCV isolate 
con 1 (EMBL-gehe bank No. AJ238799) ; amino acids are givem 
in single letter code. 

2 Colony forming units (number of cell clones) obtained 
10 with a selection of 500 pg/rril G418. 
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Table 2: Specific inf ectivities (cfu/pg RNA) of the 
parental HCV RNA construct I 3 89/NS3-3 ' /wt and the variants 
9-131, 9-13F, 5.1 and 19, , 



Trans fected RNA variant 


cfu/pg RNA 1 


wild type 


30-50 


9-13 I 


100 - 1.000 


9-13 F 


1.000 - 10.000 


5.1 * ] 


50.000 - 100.000 


19 


50.000 - 100.000 



1 Colony forming units (number of cell clones) obtained 
with a selection of 500pg/ml G418. 
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Table 3: Nucleotide and amino acid sequence differences 
between the parental HCV RNA construct I 38 9/NS3-3 ' /wt and 
the mutants 9-131, 9-13F, 5.1 und 19 



• Gloiio 






aa - exchange 


Q "1 "3 T 




O V. T1 

C > T 


P > L 






t 1 


T > M 












ft 4ft6 


P T 




......... . . .. . ,. 


flft91 


\3 -> t\ 


W > stop 


• - ■ • 


PQQ1 
O y i7 J. 


*>% 

I* ^ *3 


K > G 




U O 


n n. 








*Tl >v O 


F > S 






t ^ r* 


V > A 


-7 1 J Jt 


JODD 


n v m . 

L > T 




•• " • v.- ■ 


H ±0 O 


A > G 


R > G 




d 4 ft Q 
4 4 O :7 


a s. c 


E > A 




4 ODZ 


Vj > A 






A Oft ^ 


t n. r* 






JU t x 


a *s r* 


W X. T3 

K > K 




5166 


A > n, 

i\ \J . 


^ c* 
I\ ^ c* 




6147 




D Q 




6829 


A ^ C 
/I / o 


Ej ^ (a 




7329 
* «j 


A *S CI 


V "S. IP 

I\ £j 




7664 


a v 

r\ ^ Vj 


1 > V 




8486 


C > T 






8991 


C > G 


R > G 


NK5.1 


4180 


C > T 


T > I 




4679 


C > T 






4 682 


T > C 






5610 


C > A 


L > I 




6437 


A > G 






6666 


A* > G 


N > D 
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6842 


C > T 


- 




6926 


C > T 


- 




6930 


T > C 


S > P 




7320 


C > T 


P > S 




7389 


A > G 


K > E 


NK19 


3946 


A > G 


E > G 

• 




4078 


C > G 


A > G 




4180 


C > T 


T > I 




4682 


T > C 


- 




5610 


C > A 


L > I 




5958 


A > T 


M > L 




6170 


T > A 


- 




6596 


G > A 


•- 




6598 


C > G 


A > G 




6833 


C > T 


- 




6842 


G > T 


- 




6930 


T > C 


S > P | 




7141 


A > G 


E > G 




7320 


C > T 


P > S 




7389 


A > G 


K > E 




7735 


G > A 


S > N 



Given are the differences between the nucleotide and amino 
acid sequences of th<* parental HCV RNA sequence con 1 
(EMBL-gene bank No. AJ238799) and those of th^cfell culture 
5; adapted HCV RNAs. Numbers refer to the nucleotide and amino 
acid positions of the con 1 isolate, nt, nucleotide; aa f 
amino acid. 
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SEQUENCE LISTING 

<110> Barterischlager, Ralf 

<i20> Hepatitis C Virus cell culture system 

<130> ba-1 

..'<i40> 199 15 178.4 ' 
<141> 1999-04-03 

<160> 11 

<170> Patentln Ver. 2.1 

<210> 1 
<211> 11076 
<212> DNA 

<213> Hepatitis C Virus 

<4b0> 1 - 

gccagccccc gattggg^gc gacactccac catagatcac tcccctgtga ggaactactg 60 
tcttcacgca gaaa^cgtct agccatggcg ttagtatgag tgtcgtgcag cctccaggac 120 
. cccccctccc gggagagcca tagtggtctg cggaaccggt gagtacaccg gaattgccag 180 
gacgaccggg tcctttcttg gatcaacccg ctcaatgcct ggagatttgg gcgtgccccc 240 
gcgagactgc tagccgagta gtgttgggtc gcgaaaggcc ttgtggtact gcctgatagg 300 
gtgcttgcga gtgccccggg aggtctcgta gaccgtgcac catgagcacg aatcctaaac 360 
ctcaaagaaa aaccaaacgt aacaccaacg ggcgcgccat gattgaacaa gatggattgc 420 
acgcaggttc tccggccgct tgggtggaga ggctattcgg ctatgactgg gcacaacaga 480 
caatcggctg ctctgatgcc gccgtgttcc ggctgtcagc gcaggggcgc ccggttcttt 540 
ttgtcaagac cgacctgtcc ggtgccctga atgaactgca ggacgaggca gcgcggctat 600 
cgtggctggc cacgacgggc gttccttgcg cagctgtgct cgacgttgtc actgaagcgg 660 
gaagggactg gctgctattg ggcgaagtgc cggggcagga tctcctgtca tctcaccttg 720 
ctcctgccga gaaagtatcc atcatggctg atgcaatgcg gcggctgcat acgcttgatc 780 
cggctacctg cccattcgac caccaagcga aacatcgcat cgagcgagca cgtactcgga 840 
tggaagccgg tcttgtcgat caggatgatc tggacgaaga gcatcagggg ctcgcgccag 900 
ccgaactgtt cgccaggctc aaggcgcgca tgcccgacgg cgaggatctc gtcgtgaccc 960 
atggcgatgc ctgcttgccg aatatcatgg tggaaaatgg ccgcttttct ggattcatcg 1020 
actgtggccg gctgggtgtg gcggaccgct atcaggacat agcgttggct acccgtgata 1080 
ttgctgaaga gcttggcggc gaatgggctg accgcttcct cgtgctttac ggtatcgccg 1140 
ctcccgattc gcagcgcatc gccttctatc gccttcttga cgagttcttc tgagtttaaa 1200 
cagaccacaa cggtttccct ctagcgggat caattccgcc cctctccctc ccccccccct 1260 
aacgttactg gccgaagccg cttggaataa ggccggtgtg cgtttgtcta tatgttattt 1320 
tccaccatat tgccgtcttt tggcaatgtg agggcccgga aacctggccc tgtcttcttg 1380 
acgagcattc ctaggggtct ttcccctctc gccaaaggaa tgcaaggtct gttgaatgtc 1440 
gtgaaggaag cagttcctct ggaagcttct tgaagacaaa caacgtctgt agcgaccctt 1500 
tgcaggcagc ggaacccccc acctggcgacf 'aggtgcctct gcggccaaaa gccacgtgta 1560 
taagatacac ctgcaaaggc ggcacaaccc cagtgccacg ttgtgagttg gatagttgtg 1620 
gaaagagtca aatggctctc ctcaagcgta ttcaacaagg ggctgaagga tgcccagaag 1680 
gtaccccatt gtatgggatc tgatctgggg cctcggtgca catgctttac atgtgtttag 1740 
tcgaggttaa aaaacgtcta ggccccccga accacgggga cgtggttttc ctttgaaaaa 1800 
cacgataata ccatgggcac gaatcctaaa cctcaaagaa aaaccaaacg taacaccaac 1860 
cgccgcccac agcfacgtcaa gttcccgggc ggtggtcaga tcgtcggtgg agtttacctg 1920 
ttgccgcgca ggggccccag gttgggtgtg cgcgcgacta ggaagacttc cgagcggtcg 1980 
caacctcgtg gaaggcgaca acctatcccc aaggctcgcc agcccgaggg tagggcctgg 2040 
gctcagcccg ggtacccctg gcccctctat ggcaatgagg gcttggggtg ggcaggatgg 2100 
ctcctgtcac cccgtggctc tcggcctagt tggggcccca cggacccccg gcgtaggtcg 2160 
cgcaatttgg gtaaggtcat cgataccctc acgtgcggct tcgccgatct catggggtac 2220 
attccgctcg tcggcgcccc cctagggggc gctgccaggg ccctggcgca tggcgtccgg 2280 
gttctggagg acggcgtgaa ctatgcaaca gggaatctgc ccggttgctc cttttctatc 2340 
ttccttttgg ctttgctgtc ctgtttgacc atcccagctt ccgcttatga agtgcgcaac 2400 
gtatccggag tgtaccatgt cacgaacgac tgctccaacg caagcattgt gtatigaggca 24 60 
gcggacatga tcatgcatac ccccgggtgc gtgccctgcg ttcgggagaa caactcctcc 2520 
cgctgctggg tagcgctcac tcccacgctc gcggccagga acgctagcgt ccccactacg 2580 
acgatacgac gccatgtcga tttgctcgtt ggggcggctg ctctctgctc cgctatgtac 2640 
gtgggagatc tctgcggatc tgttttcctc gtcgcccagc tgttcacctt ctcgcctcgc 2700 
cggcacgaga cagtacagga ctgcaattgc tcaatatatc ccggccacgt gacaggtcac 2760 
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cgtatggctt gggatatgat gatgaactgg tcacctacag cagccctagt ggtatcgcag 2820 
ttactccgga tcccacaagc tgtcgtggat atggtggcgg gggcccattg gggagtccta 2880 
gcgggccttg cctactattc catggtgggg aactgggcta aggttctgat tgtgatgcta 2940 
ctctttgccg gcgttgacgg gggaacctat gtgacagggg ggacgatggc caaaaacacc 3000 
ctcgggatta cgtccctctt ttcacccggg tcatcccaga aaatccagct tgtaaacacc 3060 
aacggcagct ggcacatcaa caggactgcc ctgaactgca atgactccct caacactggg 3120 
ttcettgctg cgctgttcta cgtgcacaag ttcaactcat ctggatgccc agagcgcatg 3180 
gccagctgca gccccatcga cgcgttcgct caggggtggg ggcccatcac ttacaatgag 3240 
tcacacagct cggaccagag gccttattgt tggcactacg caccccggcc gtgcggtatc 3300 
.gtaciccgcgg cgcaggtgtg tggtccagtg tactgcttca ccccaagccc tgtcgtggtg 3360 
gggacgaccg accggttcgg cgtccctacg tacagttggg gggagaatga gacggacgtg 3420 
ctgcttctta acaacacgcg gccgccgcaa ggcaactggt ttggctgtac atggatgaat 3480 
agcactfgggt tcaccaa^ac gtgcgggggc cccccgtgta acatcggggg gatcggcaat 3540 * 
aaaaccttga cctgccccac ggactgcttc cggaagcacc ccgaggccac ttacaccaag 3600 
tgtggttcgg ggccttggtt gacacccaga tgcttggtcc actacccata caggctttgg 3660 
cactacccct gcactgtcaa ctttaccatc ttcaaggtta ggatgtacgt ggggggagtg 3720 
gagcacaggc tcgaagccgc atgcaattgg actcgaggag agcgttgtaa cctgga^gac 3780 
agggacagat cagagcttag cccgctgctg ctgtctacaa cggagtgigca ggtattgccc 3840 
tgttccttca ccaccctacc ggctctgtcc actggtttga tccatctcca tcagaacgtc 3900 
gtggacgtac aatacctgta cggtataggg tcggcggttg tctcctttgc aatcaaatgg 3960 
gagtatgtcc tgttgctctt ccttcttctg gcggacgcgc gcgtctgtgc ctgcttgtgg 4020 
atgatgctgc tgatagctca agctgaggcc gccctagaga acctggtggt cctcaacgcg. 4080 
gcatccgtgg ccggggcgca tggcattctc tccttcctcg tgttcttctg tgctgcctgg 4140 
tacatcaagg gcaggctggt ccctggggcg gcatatgccc tctacggcgt atggccgcta 4200 
ctcctgctcc tgctggcgtt accaccacga gcatacgcca tggaccggga gatggcagca 4260 
tcgtgcggag gcgcggtttt cgtaggtctg atactcttga ccttgtcacc gcactataag 4320 
ctgttcctcg ctaggctcat atggtggtta caatatttta tcaticagggc cgaggcacac 4380 
ttgcaagtgt ggatcccccc cctcaacgtt cgggggggdc gcgatgccgt catcctcctc 4440 
acgtgcgcga tccacccaga gctaatcttt accatcacca aaatcttgct cgccatactc 4500 
: ggtccactca tg^tgctcca ggctggtata accaaagtgc cgtacttcgt gcgcgcacac 4560 
gggctcattc gtgcatgcat gctggtgcgg aaggttgctg ggggtcatta tgtccaaatg 4620 
gctctcatga agttggccgc actgacaggt acgtacgttt atgaccatct caccccactg 4680 
cgggactggg cccacgcggg cctacgagac cttgcggtgg cagttgagcc cgtcgtcttc 4740 
tctgatatgg a<jaccaa§gt tatcacctgg ggggcagaca ccgcggcgtg tggggacatc 4800 
atcttgggcc tgcccgtctc cgcccgcagg gggagggaga tacatctggg accggcagac 4860 
agccttgaag ggcaggggtg gcgactcctc gcgfcctatta cggcctactc ccaacagacg 4920 
cgaggcctac ttggctgcat catcactagc ctcacaggcc gggacaggaa ccaggtcgag 4980 
ggggaggtcc aagtggtctc caccgcaaca caatctttcc tggcgacctg cgtcaatggc 5040 
gtgtgttgga ctgtctatca tggtgccggc tcaaagaccc ttgccggcdc aaagggccca 5100 
atcacccaaa tgtacaccaa tgtggaccag gacctcgtcg gctggcaagc gccccccggg 5160 
gcgcgttcct tgacaccatg cacctgcggc agctcggacc tttacttggt cacgaggcat 5220 
gccgatgtca ttccggtgcg ccggcggggc gacagcaggg ggagcctact ctcccccagg 5280 
• cccgtctcct acttgaaggg ctcttcgggc ggtccactgc tctgcccctc ggggcacgct 5340 
gtgggcatct ttcgggctgc cgtgtgcacc cgaggggttg cgaaggcggt ggactttgta 5400 
cccgtcgagt ctatggaaac cactatgcgg tccccggtct tcacggacaa ctcgtcccct 5460 
ccggccgtac cgcagacatt ccaggtggcc catctacacg cccctactgg tagcggcaag 5520 
agcactaag^ tgccggctgc gtatgcagccr'caagggtata aggtgcttgf cctgaacccg 5580 
tccgtcgccg ccaccctagg tttcggggcg tatatgtcta aggcacatgg tatcgaccct 5640 
aacatcagaa ccggggtaag gaccatcacc acgggtgccc ccatcacgta ctccacctat 5700 
ggcaagtttc ttgccgacgg tggttgctct gggggcgcct atgacatcat aatatgtgat 5760 
gagtgccact caactgactc gaccactatc ctgggcatcg gcacagtcct ggaccaagcg 5820 
gagacggctg gagcgcgact cgtcgtgctc gccaccgcta cgcctccggg atcggtcacc 5880 
gtgccacatc caaacatcga gga^gtggct ctgtcGagca ctggagaaat ccccttttat 5940 
ggcaaagcca tccccatcga gaccatcaag ggggggaggc acctcatttt ctgccattcc 6000 
aagaagaaat gtgatgagct cgccgcgaag ctgtccggcc tcggactcaa tgctgtagca 6060 
tattaccggg gccttgatgt atccgtcata ccaactagcg gagacgtcat tgtcgtagca 6120 
acggacgctc taatgacggg ctttaccggc gatttcgact cagtgatcga ctgcaataca 6180 
tgtgtcaccc agacagtcga cttcagcctg gacccgacct tcaccattga gacgacgacc 6240 
gtgccacaag acgcggtgtc acgctcgcag cggcgaggca ggactggtag gggcaggatg 6300 
ggcatttaca ggtttgtgac tccaggagaa cggccctcgg gcatgttcga ttcctcggtt 6360 
ctgtgcgagt gctatgacgc gggctgtgct tggtacgagc tcacgcccgc cgagacctca 6420 
gttaggttgc gggcttacct aaacacacca gggttgcccg tctgccagga ccatctggag 6480 
ttctgggaga gcgtctttac aggcctcacc cacatagacg cccatttctt gtcccagact 6540 
aagcaggcag gagacaactt cccctacctg gtagcatacc aggctacggt gtgcgccagg 6600 
gctcaggctc cacctccatc gtgggaccaa atgtggaagt gtctcatacg gctaaagcct 6660 
acgctgcacg ggccaacgcc cctgctgtat aggctgggag ccgttcaaaa cgaggttact 6720 
accacacacc ccataaccaa atacatcatg gcatgcatgt cggctgacct ggaggtcgtc 6780 
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acgagcacct gggtgctggt aggcggagtc ctagcagctc tggccgcgta ttgcctgaca 684 0 
acaggcagcg tggtcattgt gggcaggatc atcttgtccg gaaagccggc catcattccc 6900 
gacagggaag tcctttaccg ggagttcgat gagatggaag agtgcgcctc acacctccct 6960 
tacatcgaac agggaatgca gctcgccgaa caattcaaac agaaggcaat cgggttgctg 7020 
caaacagcca ccaagcaagc ggaggctgct gctcccgtgg tggaatccaa gtggcggacc 7080 
ctcgaagcct tctgggcgaa gcatatgtgg aatttcatca gcgggataca atatttagca 7140 
ggcttgtcca ctctgcctgg caaccccgcg atagcatcac tgatggcatt cacagcctct 7200 
atcaccagcc cgctcaccac ccaacataec ctcctgttta acatcctggg gggatgggtg 7260 
gccgcccaac ttgctcctcc cagcgctgct tctgctttcg taggcgccgg catcgctgga 7320 
gcggctgttg gcagcatagg ccttgggaag gtgcttgtgg atattttggc aggttatgga .7380 
gcaggggtgg caggcgcgct cgtggccttt aaggtcatga gcggcgagat gccctccacc 7440 
gaggacctgg ttaacctact ccctgctatc ctctcccctg gcgccctagt cgtcggggtc 7500 
gtgtgcgcag cgatactgcg tcggcacgtg ggcccagggg agggggctgt gcagtggatg 7560 
aaccggctga tagcgttcgc ttcgcggggt aaccacgtct cccccacgca ctatgtgcct 7 620 
gagagcgacg ctgcagcacg tgtcactcag atcctctcta gtcttaccat cactcagctg 7680 
ctgaagaggc ttcaccagtg gatcaacgag gactgctcca cgccatgctc cggctcgtgg 7740 
ctaagagatg tttgggattg gatatgcacg gtgttgactg atttcaagac ctggctccag 7800 
tccaagctcc tgccgcgatt gccgggagtc cccttcttct catgtcaacg tgggtacaag 7860 
ggagtctggc ggggcgacgg catcatgcaa accacctgcc catgtggagc acagatcacc 7920 
ggacatgtga aaaacggttc catgaggatc gtggggccta ggacctgtag . taacacgtgg 7980 
catggaacat tccccattaa cgcgtacacc acgggcccct gcacgccctc cccggcgcca 8040 
aattattcta gggcgctgtg gcgggtggct gctgaggagt acgtggaggt tacgcgggtg 8100 
ggggatttcc actacgtgac ggg'catgacc actgacaacg taaagtgccc gtgtcaq/gtt 8160 
ccggcccccg aattcttcac agaagtggat ggggtgcggt tgcacaggta cgctccagcg 8220 
tgcaaacccc tcctacggga ggaggtcaca ttcctggtcg ggctcaatca atacbtggtt 8280 
gggtcacagc tcccatgcga gcccgaaccg gacgtagcag tgctcacttc catgctcacc 8340 
gacccctccc acattacggc ggagacggct aagcgtaggc tggccagggg atctcccccc 8400 
tccttggcca gctcatcagc tagccagctg tctgcgcctt ccttgaaggc aacatgcact 8460 
acccgtcatg actccccgga cgctgacctc atcgaggcca acctcctgtg gcggcaggag 8520 
atgggcggga acatcacccg cgtggagtca gaaaataagg tagtaatttt ggactctttc . 8580 
gagccgctcc aagcggagga ggatgagagg gaagtatccg ttccggcgga gat cctgcgg 8640 
aggtccagga aattccctcg agcgatgccc atatgggcac gcccggatta caaccctcca 8700 
ctgttagagt cctggaagga cccggactac gtccctccag tggtacacgg gtgtccattg 8760 
ccgcctgcca aggcccctcc gataccacct ccacggagga agaggacggt tgtcctgtca 8820 
gaatctaccg tgtcttctgc cttggcggag ctcgccacaa agaccttogg cagctccgaa 8880 
tcgtcggccg tcgacagcgg cacggcaacg gcctctcctg accagccctc cgacgacggc 8940 
gacgcgggat ccgacgttga gtcgtactcc tccatgcccc cccttgaggg ggagcc<gggg 9000 
gatcccgatc tcagcgacgg gtcttggtct accgtaagcg aggagigctag tgaggacgtc 9060 
gtctgctgct cgatgtccta cacatggaca ggcgccctga tcacgccatg cgctgcggag 9120 
gaaaccaagc tgcccatcaa tgcactgagc aactctttgc tccgtcacca caacttggtc 9180 
tatgctacaa catctcgcag cgcaagcctg cggcagaaga aggtcacctt tgacagactg 9240 
caggtcctgg acgaccacta ccgggacgtg ctcaaggaga tgaaggcgaa ggcgtccaca 9300 
gttaaggcta aacttctatc cgtggaggaa gcctgtaagc tgacgccccc acattcggcc 9360 
agatctaaat ttggctatgg ggcaaaggac gtccggaacc tatccagcaa ggccgttaac 9420 
cacatccgct ccgtgtggaa ggacttgctg gaagacactg agacaccaat tgacaccacc 9480 . 
atcatggcaa aaaatgaggt tttctgcgtc caaccagaga aggggggccg caagccagct 9540 
cgccttatcg tattcccaga tttggggfgtt* 'cgtgtgtgcg agaaaatggc cctttacgat 9600 
gtggtctcca ccctccctca ggccgtgatg ggctcttcat acggattcca atactctcct 9660 
ggacagcggg tcgagttcct ggtgaatgcc tggaaagcga agaaatgccc tatgggcttc 9720 
gcatatgaca cccgctgttt tgactcaacg gtcactgaga atgacatccg tgttgaggag 9780 
tcaatctacc aatgttgtga cttggccccc gaagccagac aggccataag gtcgctcaca 9840 
gagcggcttt acatcggggg ccccctgact aattctaaag ggcagaactg cggctatcgc 9900 
cggtgccgcg cgagcggtgt actgacgacc agctgcggta ataccctcac atgttacttg 9960 
aaggecgctg cggcctgtcg agctgcgaag ctccaggact gcacgatgct cgtatgcgiga 10020 
gacgaccttg tcgttatctg tgaaagcgcg gggacccaag aggfacgaggc gagcctacgg 10080 
gccttcacgg aggctatgac tagatactct gccccccctg gggacccgcc caaaccagaa 10140 
tacgacttgg agttgataac atcatgctcc tccaatgtgt cagtcgcgca cgatgcatct 10200 
ggcaaaaggg tgtactatct cacccgtgac cccaccaccc cccttgcgcg ggctgcgtgg 10260 
gagacagcta gacacactcc agtcaattcc tggctaggca acatcatcat gtatgcgccc 10320 
accttgtggg caaggatgat cctgatgact catttcttct ccatccttct agctcaggaa 10380 
caacttgaaa aagccctaga ttgtcagatc tacggggcct gttactccat tgagccactt 10440 
gacctacctc agatcattca acgactccat ggccttagcg cattttcact ccatagttac 10500 
tctccaggtg agatcaatag ggtggcttca tgcctcagga aacttggggt. accgcccttg 10560 
cgagtctgga gacatcgggc cagaagtgtc cgcgctaggc tactgtccca gggggggagg 10620 
gctgccactt gtggcaagta cctcttcaac tgggcagtaa ggaccaagct caaactcact 10680 
ccaatcccgg ctgcgtccca gttggattta tccagctggt tcgttgctgg ttacagcggg 10740 
ggagacatat atcacagcct gtctcgtgcc cgaccccgct ggttcatgtg gtgcctactc 10800 
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cta^tttctg tag'gggtagg catctatcta ctccccaacc gatbjaacggg gagctaaaca 10860 

ctccaggcca ataggccatc ctgttttttt cccttttttt ttttcttttt tttttttttt 10920 

tttttttttt tttttttttc tccttttttt ttcctctttt tttccttttc tttcctttgg 10980 

tggctceatc ttagdcctag tcacggctag ctgtgaaagg tccgtgagcc gcttgactgc 11040 

agagagtgct gatactggcc tctctgcaga tcaagt 11076 
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<210> 2 
<211> 8637 
<212> DNA 

<213> Hepatitis C Virus • 
<400> 2 

gccagccccc gattgggggc gacactccac catagatcac tcccctgtga ggaactactg 60 
tcttcacgca gaaagcgtct agccatggcg ttagtatgag tgtcgtgcag cctccaggac 120 
cccccctccc gggagagcca tagtggtctg cggaaccggt gagtacaccg gaattgccag 180 
§acgaccggg tcctttcttg gatcaacccg ctcaatgcct ggagatttgg gcgtgccccc 240 
. gegagactgc tagccgagta gtgttgggtc gcgaaaggcc ttgtggtact gcctgatagg 300 
gtgcttgcga gtgccccggg aggtctcgta gaccgtgcac catgagcacg aatcctaaac 360 
ctcaaagaaa aaccaaaggg cgcgccatga ttgaacaaga tggattgcac gcaggttctc 420 
cggccgcttg ggtggagagg ctattcggct atgactgggc acaacagaca atcggctgct .480 
ctgatgccgc cgtgttccgg ctgtcagcgc aggggcgccc ggttcttttt gtcaagaccg 540 
acctgtccgg tgccctgaat gaactgcagg acgaggcagc gcggctatcg tggctggbca 600 
' cgacgggcgt tcctt^fcgca gctgtgctcg acgttgtcac tgaagcggga agggactggc 660 
. tgctattggg cgaagtgccg gggcaggatc tcctgtcatc tcaccttgct cctgccgaga 720 
■aagtatccat catggctgat gcaatgcggc ggctgcatac gcttgatccg gctacctgcc 780 
cattcgacca ccaagcgaaa catcgcatcg agcgagcacg tactcggatg gaagccggtc 840 
. ttgtcgatca g r 9 atl 9 at ctg gacgaagagc atcaggggct cgcgccagcc gaactgttcg 900 
ccaggctcaa g^cgcgcatg cccgacggCg aggatctcgt cgtgacccat ggcgatgcct 960 
gcttgccgaa tatcatggtg gaaaatggcc gcttttctgg attcatcgac tgtggccggc 1020 
tgggtgtggc ggaccgctat caggacatag cgttggctac ccgtgatatt gctgaagagc 1080 
ttggcggcga atgggctgac cgcttcctcg tgctttacgg tatcgccgct cccgattcgc 1140 
agcgcatcgc cttctatcgd cttcttgacg agttcttctg agtttaaaca gaccacaacg 1200 
gtttccctct agcgggatca attccgcccc tct.ccctccc ccccccctaa cgttactggc 1260 
cgaagccgct tggaataagg ccggtgtgcg tttgtctata tgttattttc caccatattg 1320 
ccgtcttttg gcaatgtgag ggcccggaaa cctggccctcj tcttcttgac gagcattcct 1380 
. aggggtcttt cccctctcgc caaaggaatg caaggtctgt tgaatgtcgt gaaggaagca 1440 
. gttcctctgg aagcttcttg aagacaaaca acgtctgtag cgaccctttg caggcagcgg 1500 
aaccccccac ctggcgacag gtgcctctgc ggccaaaagc cacgtgtata agatacacct .1560 
gcaaaggcgg cacaacccca gtgccacgtt gtgagttgga tagttgtgga aaigagtcaaa 1620 
: tggctctcct caagcgtatt caacaagggg ctgaaggatg cccagaaggt accccattgt 1680 
atgggatctg atctggggcc tcggtgcaca tgctttacat gtgtttagtc gaggttaaaa 1740 
. aacgtCtagg ccccccgaac cacggggacg tggttttcct ttgaaaaaca cgataatacc 1800 
: atggaCcggg agatggcagc atcgtgcgga ggcgcggttt tcgtaggtct gatactcttg i860 
accttgtcac cgcactataa gctgttcctc gctaggctca tatggtggtt acaatatttt 1920 
atcaccaggg ccgaggcaca cttgcaagtg tggatccccc ccctcaacgt tcgggggggc 1980 
cgcgatgccg tcatcctcct cacgtgcgcg atccacccag agctaatctt. taccatcacc 2040 
aaaatcttgc tcgccatact cggtccactc atggtgctcc aggctggtat aaccaaagtg 2100 
ccgtacttcg tgcgcgcaca cgggctcatt cgtgcatgca tgctggtgcg gaaggttgct 2160 
gggggtcatt atgtccaaat ggctctcatg aagttggccg cactgacagg tacgtabgtt 2220 
tatgaccatc tcaccccact gcgggactgg gcccacgcgg gcctacgaga ccttgcggtg 2280 
gcagttgagc ccgtcgtctt ctctgatatg gagaccaagg ttatcacctg gggggcagac 2340 
accgcggcgt gtggggacat catcttgggc 'ctgcccgtct ccgcccgcag ggggagggag 2400 
atacatctgg gaccggcaga cagccttgaa gggcaggggt ggcgactcct cgcgcctatt 2460 
acggcctact cccaacagac gcgaggccta cttggctgca tcatcactag cctcacaggc 2520 
.cgggacagga accaggtcga gggggaggtc caagtggtct ccaccgcaac acaatctttc 2580 
ctggcgacct gcgtcaatgg cgtgtgttgg actgtctatc atggtgccgg ctcaaagacc 2640 
cttgccggcc eaaagggccc aatcacccaa atgtacacca atgtggacca ggacctcgtc 2700 
ggctgigcaag cgccccccgg ggcgcgttcc ttgacaccat gcacctgcgg cagctcggad 2760 
ctttacttgg tcacgaggca tgccgatgtc attccggtgc gccggcgggg cgacagcagg 2820 
gggagcctac tctcccccag gcccgtctcc tacttgaagg gctcttcggg cggtccactg 2880 
ctctgcccct cggggcacgc tgtgggcatc tttcgggctg ccgtgtgcac ccgaggggtt 2940 
gcgaaggcgg tggactttgt acccgtcgag tctatggaaa ccactatgcg gtccccggtc 3000 
ttcacggaca actcgtcccc tccggccgta ccgcagacat tccaggtggc ccatctacac 3060 
gcccctactg gtagcggcaa gagcactaag gtgccggctg cgtatgcagc ccaagggtat 3120 
aaggtgcttg tcctgaaccc gtccgtcgcc gccaccctag gtttcggggc gtatatgtct 3180 
aaggcacatg gtatcgaccc taacatcaga accggggtaa ggaccatdac cacgggtgcc 3240 
cccatcacgt actccaccta tggcaagttt cttgccgacg gtggttgctc tgggggcgcc 3300 
tatgacatca taatatgtga tgagtgccac tcaactgact cgaccactat cctgggcatc 3360 
ggcacagtcc tggaccaagc ggagacggct ggagcgcgac tcgtcgtgct cgccaccgct 3420 
acgcctccgg gatcggtcac cgtgccacat ccaaacatcg aggaggtggc tctgtccagc 3480 
actggagaaa tcccctttta tggcaaagcc atccccatcg agaccatcaa gggggggagg 3540 
cacctcattt tctgccattc caagaagaaa tgtgatgagc tcgccgcgaa gctgtccggc 3600 
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ctcggactca atgctgtagc atattaccgg ggccttgatg tatccgtcat accaactagc 3660 
ggagacgtca ttgtcgtagc aacggacgct ctaatgacgg gctttaccgg cgatttcgac 3720 
tcagtgatcg actgcaatac atgtgtcacc cagacagtcg acttcagcct ggacccgacc 3780 
ttcaccattg agacgacgac cgtgccacaa gacgcggtgt cacgctcgca gcggcgaggc 3840 
aggactggta ggggcaggat gggcatttac aggtttgtga ctccaggaga acggccctcg 3900 
ggcatgttcg attcctcggt tctgtgcgag tgctatgacg cgggctgtgc ttggtacgag 3960 
ctcacgcccg cccjagacctc agttaggttg cgggcttacc taaacacacc agggttgccc 4020 
gtctgccagg accatctgga gttctgggag agcgtettta cagcjcctcac ccacatagad 4080 
gcccatttct tgtcccagac taagcaggca ggagacaact tcccctacct ggtagcatac 4140 
caggctacgg tgtgcgccag ggctcaggct ccacctccat cgtgggacca aatgtggaag 4200 
tgtctcatac ggctaaagcc tacgct^cac gggccaacgc ccctgctgta taggctggga 4260 
gccgttcaaa acgaggttac taccacacac cccataacca aatacatcat ggcatgcatg 4320 
tcggctgacc tggag^tcgt cacgagcacc tgggtgctgg taggcggagt cctagcagct 4380 
dtggccgcgt attgcctgac aacaggcagc gt^gtcattg tgggcaggat catcttgtcc 44 40 
ggaaagccgg ccatcattcc cgacagggaa gtcctttacc gggagttcga tgagatggaa' 4500 
gagtgcgcct cacacctccc ttacatcgaa cagggaatgc agctcgccga acaattcaaa 4560 
cagaaggcaa tcgggttgct gcaaacagcc accaagcaag cggaggctgc tgctcccgtg 4620 
gtggaatcca agtggcggac cctcgaagcc ttctgggcga agcatatgtg gaatttcatc 4680 
agcgggatac aatatttagc aggcttgtcc actctgcctg gcaaccccgc gatagcatca 4740 
ctgatggcat tcacagcctc tatcactagc ccgctcacca cccaaeatac; cctcctgttt 4800 
aacatcctgg ggfggatgggt ggccgcccaa cttgctcctc ccagcgctgc ttctgctttc 4860 
gtaggcgccg gcatcgctgg agcggctgtt ggcagcatag gccttgggaa ggtgcttgtg 4920 
gatattttgg caggttatgg agcaggggtg gcaggcgcgc tcgtggcctt taaggtcatg 4980 
agcggcgaga tgccctccac cgaggacctg gttaacctac tccctgctat cctctcccct 5040 
ggcgccctag tcgtcggggt cgtgtgcgca gcgatactgc gtcggcacgt gggcccaggg 5100 
gagggggctg tgcagtggat gaaccggctg atagcgttcg cttcgcgggg taaccacgtc 5160 
tcGcccacgc actatgtgcc tgagagcgac gctgcagcac gtgtcactca gatcctctct 5220 
agtcttacca tcactcagct gctgaagagg cttcaccagt ggatcaacga ggactgctcc 5280 
acgccatgct ccggctcgtg gctaagagat gtttgggatt ggatatgcac ggtgttgact 5340 
gatttcaaga cctggctcca gtccaagctc ctgccgcgat tgccgggagt ccccttcttc 5400 
tcatgtcaac gtgggtacaa gggagtctgg cggggcgacg gcatcatgca aaccacctgc 54 60 
ccatgtggag cacagatcac cggacatgtg aaaaacggtt ccatgaggat cgtggggcct 5520 
. aggacctgta gtaacacgtg gcatggaaca ttccccatta acgcgtacac cacgggcccc 5580 
tgcacgccct ccccggcgcc aaattattct agggcgctgt ggcgggtggc tgctgaggag 5640 
tacgtggagg ttacgcgggt gggggatttc cactacgtga cgggcatgac cactgacaac 5700 
gtaaagtgcc cgtgtcaggt tccggccccc gaattcttca cagaagtgga tggggtgcgg 5760 
ttgcacaggt acgctccagc gtgcaaaccc ctcctacggg aggaggtcac attcctggtc 5820 
gggctcaatc aatacctggt tgggtcacag ctcccatgcg agcccgaacc ggacgtagca 5880 
gtgctcactt ccatgctcac cgacccctcc cacattacgg cggagacggc taagcgtagg 5940 
ctggccaggg gatctccccc ctccttggcc agctcatcag ctagccagct gtctgcgcct 6000 
tccttgaagg caacatgcac tacccgtcat gactccccgg acgctgacct catcgaggcc 6060 
aacctcctgt ggcggcagga gatgggcggg aacatcaccc gcgtggagtc agaaaataag 6120 
gtagtaattt tggactcttt cgagccgctc caagcggagg aggatgagag ggaagtatcc 6180 
gttccggcgg agatcctgcg gaggtccagg aaattccctc gagcgatgcc catatgggca 6240 
cgcceggatt acaaccctcc actgttagag tcctggaagg acccggacta cgtccctcca 6300 
gtggtacacg ggtgtccatt gccgcctgcc aaggcccctc cgataccacc tccacggagg 6360 
aagaggacgg ttgtcctgtc agaatctacc"'gtgtcttctg ccttggcgga gctcgccaca 6420 
aagaccttcg gcagctccga atcgtcggcc gtcgacagcg gcacggcaac ggcctctcct 6480 
gaccagccct ccgacgacgg cgacgcggga tccgacgttg agtcgtactc ctccatgccc 6540 
ccccttgagg gggagccggg ggatcccgat ctcagcgacg ggtcttggtc taccgtaagc 6600 
gaggaggcta gtgaggacgt cgtctgctgc tcgatgtcct acacatggac aggcgccctg 6660 
atcacgccat gcgctgcgga ggaaaccaag ctgcccatca atgcactgag caactctttg 6720 
ctccgtcacc acaacttggt ctatgctaca acatctcgca gcgcaagcct gcggcagaag 6780 
aaggtcacct ttgacagact gcaggtcctg gacgaccact accgggacgt gctcaaggag 6840 
atgaaggcga aggcgtccac agttaaggct aaacttctat ccgtggagga agcctgtaag 6900 
ctgacgcccc cacattcggc cagatctaaa tttggctatg gggcaaagga cgtccggaac 6960 
ctatccagca aggccgttaa ceacatccgc tccgtgtgga aggacttgct ggaagacact 7020 
gagacaccaa ttgacaccac catcatggca aaaaatgagg ttttctgcgt ccaaccagag 7080 
aaggggggcc gcaagccagc tcgccttatc gtattcccag atttgggggt tcgtgtgtgc 7140 
gagaaaatgg ccctttacga tgtggtctcc accctccctc aggccgtgat gggctcttca 7200 
tacggattcc aatactctcc tggacagcgg gtcgagttcc tggtgaatgc ctggaaagcg 7260 
aagaaatgcc ctatgggctt cgcatatgac acccgctgtt ttgactcaac ggtcactgag 7320 
aatgacatcc gtgttgagga gtcaatctac caatgttgtg acttggcccc cgaagccaga 7380 
caggccataa ggtcgctcac agagcggctt tacatcgggg gccccctgac taattctaaa 74 40 
gggcagaact gcggctatcg ccggtgccgc gcgagcggtg tactgacgac cagctgcggt 7500 
aataccctca catgttactt gaaggccgct gcggcctgtc gagctgcgaa gctccaggac 7560 
tgcacgatgc tcgtatgcgg agacgacctt gtcgttatct gtgaaagcgc ggggacccaa 7620 
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gaggacgagg cgagcctacg ggccttcacg 
ggggacccgc ccaaaccaga atacgacttg 
tcagtcgcgc acgatgcatc tggcaaaagg 
ccccttgcgc gggctgcgtg ggagacagct 
a a ca tea tea tgtatgegee caccttgtgg 
tccatccttc tagctcagga acaacttgaa 
tgttactcca ttgagecact tgacctacct 
gcattttcac tccatagtta ctctccaggt 
aaacttgggg taccgcectt gcgagtctgg 
ctactgtccc agggggiggag ggctgccact 
aggaccaagc tcaaactcac tccaatcccg 
ttcgttgctg gttacagegg gggagacata 
tggttcatgt ggtgcctact cctactttct 
egatgaaegg ggagctaaac actccaggcc 
tttttctttt tttttttttt tttttttttt 
ttttcctttt ctttcctttg gtggctccat 
gtccgtgagc cgcttgactg cagagagtgc 



gaggctatga ctagatactc tgccccccct 7680 
gagttgataa catcatgctc ctccaatgtg 7740 
gtgtactatc tcacccgtga ccccaccacc 7800 
agacacactc cagtcaattc ctggctaggc 7860 
gcaaggatga tcctgatgac tcatttcttc 7920 
aaagecctag attgtcagat ctacggggcc 7980 
cagatcattc aacgactcca tggecttage 8040 
gagatcaata gggtggcttc atgectcagg 8100 
agacateggg ccagaagtgt ccgegbtagg 8160 
tgtggcaagt acctcttcaa ctgggcagta 8220 
gctgcgtccc agttggattt. atccagctgg 8280 
tatcacagcc tgtctcgtgc ccgaccccgc 8340 
gtaggggtag gcatctatct actccccaac 8400 
aataggecat cctgtttttt tccctttttt 84 60 
tttttttttt ctcctttttt tttcctcttt 8520 
cttagcccta gtcaeggcta gctgtgaaag 8580 
tgatactggc ctctctgcag atcaagt 8637 
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<210> 3 . . • - 

<211> 8001 
<212> DNA 

<213> Hepatitis C Virus 
. <400> 3 

gccagccccc gattgggggc gacactccac catagatcac tcccctgtga ggaactactg 60 
tcttcacgca gaaagcgtct agccatggcg ttagtatgag tgtcgtgcag cctccaggac 120 
cccccctccc gggagagcca tagtggtctg cggaaccggt gagtacaccg gaattgccag 180 
gacgaecggg tc'ctttcttg gatcaacccg ctcaatgcct ggagatttgg gcgtgccccc 240 
..gcgagactgc tagccgagta gtgttgggtc gcgaaaggcc ttgtggtact gcctgatagg 300 
gtgcttgcga <gtgccccggg aggtctcgta gaccigtgcac catgagcacg aatcctaaac 360 
ctcaaagaaa aaccaaacgt aacaccaacg ggcgegccat gattgaacaa gatggattgc .420 
acgcaggttc tccggccgct tgggtggaga ggctattcgg ctatgactgg gcacaacaga 480 
caatcggctg ctctgatgcc gccgtgttcc ggctgtcagc gcaggggcgc ccggttcttt 540 
ttgtcaagac cgacctgtcc ggtgccctga atgaactgca ggacgaggca gcgcggctat 600 
cgtggctggc cacgacgggc gttccttgcg cagctgtgct cgacgttgtc actgaagcgg 660 
gaagggactg gctgctattg ggcgaagtgc cggggcagga tctcctgtca tctcaccttg 720 
ctcctgccga gaaagtatcc atcatggctg atgcaatgcg gcggctgcat acgcttgatc 780 
cggctacctg cccattcgac caccaagcga aacatcgcat cgagcgagca tgtactcgga 840 
tggaagccgg tcttgtcgat caggatgatc tggacgaaga gcatcagggg ctcgcgccag 900 
ccgaactgtt cgccaggctc aaggcgcgca tgcccgacgg cgaggatctc gtcgtgaccc 960 
atgigcgatgc ctgcttgccg aatatcatgg tggaaaatgg cc^fcttttct ggattcatcg 1020 
actgtggccg gctgggtgtg gcggaccgct atcaggacat agcgttggct acccgtgata 1080 
ttgctgaaga gcttggcggc gaatgggctg accgcttcct cgtgctttac ggtatcgccg 1140 
ctcccgattc gcagcgcatc gccttctatc gccttcttga cgagttcttc tgagtttaaa 1200 
cagaccacaa cggtttccct ctagcgggat caattccgcc cctctccctc ccccccccct 1260 
aacgttactg gccgaagdcg cttggaataa ggccggtgtg cgtttgtcta tatgttattt 1320 
tccaccatat tgccgtcttt tggcaatgtg agggcccgga aacctggccc tgtcttcttg 1380 
acgagcattc ctaggggtct ttcccctctc gccaaaggaa tgcaaggtct gttgaatgtc 1440 
gtgaaggaag cagttcctct ggaagcttct tgaagacaaa caacgtctgt agcgaccctt 1500 
tgcaggcagc ggaacccccc acctggcgac aggtgcctct gcggccaaaa gccacgtgta 1560 
taagatacac ctgcaaa^gc ggcacaaccc cagtgccacg ttgtgagttg gatagttgtg 1620 
gaaa<gagtca aatggctctc ctcaagcgta ttcaacaagg ggctgaagga tgcccagaag 1680 
gtaccccatt gtatgggatc tgatctgggg cctcggtgca catgctttac atgtgtttag 1740 
tcgaggttaa aaaacgtcta ggcC6cccga accacgggga cgtggttttc ctttgaaaaa 1800 
cacgataata ccatggcgcc tattacggcc tactcdcaac agacgcgagg cctacttggc 1860 
tgcatcatca ctagcctcac aggccgggac aggaaccagg tcgaggggga ggtccaagtg 1920 
gtctccaccg caacacaatc tttcctggcg acctgcgtca atggcgtgtg ttggactgtc 1980 
tatcatggtg cciggctcaaa gacccttgcc ggcccaaagg gcccaatcac ccaaatgtac 2040 
accaatgtgg accaggacct cgtcggctgg caagcgcccc ccggggcgcg ttccttgaca 2100 
ccatgcacct gcggcagctc ggacctttac ttggtcacga ggcatgccga tgtcattccg 2160 
gtgcgccg^c ggggcgacag cagggggagc ctactctccc ccaggcccgt ctcctacttg 2220 
aagggctctt cgggcggtcc actgctctgc ccctcggggc acgctgtggg catctttcgg 2280 
gctgccgtgt gcacccgagg ggttgcgaag'gcggtggact ttgtacccgt cgagtctatg 2340 
gaaaccacta tgcggtcccc ggtcttcacg gacaactcgt cccctccggc cgtaccgcag 2400 
acattccagg tggcccatct acacgcccct actggtagcg gcaagagcac taaggtgccg 2460 
gctgcgtatg cagcccaagg gtataaggtg cttgtcctga acccgtccgt cgccgccacc 2520 
ctaggtttcg gggcgtatat gtctaaggca catggtatcg accctaacat cagaaccggg 2580 
gtaaggacca tcaccacggg tgcccccatc acgtactcca cctatggcaa gtttcttgcc 2640 
gacggtggtt gctctggggg cgcctatgac atcataatat gtgatgagtg ccactcaact 2700 
gactcgacca ctatcdtggg catcggcaca gtcctggacc aagcggagac ggctggagcg 2760 
cgactcgtcg tgctcgccac cgctacgcct ccgggatcgg tcaccgtgcc acatccaaac 2820 
atcgaggagg tggctctgtc cagcactgga gaaatcccct tttatggcaa agccatcccc 2880 
atcgagacca tcaagggggg gaggcacctc attttctgcc attccaagaa gaaatgtgat 2940 
gagctcgccg cgaagctgtc cggcctcgga ctcaatgctg tagcatatta ccggggcctt 3000 
gatgtatccg tcataccaac tagcggagac gtcattgtcg tagcaacgga cgctctaatg 3060 
acgggcttta ccggcgattt cgactcagtg atcgactgca atacatgtgt cacccagaca 3120 
gtcgacttca gcctggaccc gaccttcacc attgagacga cgaccgtgcc acaagacgcg 3180 
gtgtcacgct cgcagcggcg aggcaggact ggtaggggca ggatgggcat ttacaggttt 3240 
gtgactccag gagaacggcc ctcgggcatg ttcgattcct cggttctgtg cgagtgctat 3300 
gacgcgggct gtgcttggta cgagctcacg cccgccgaga cctcagttag gttgcgggct 3360 
tacctaaaca caccagggtt gcccgtctgc caggaccatc tggagttctg ggagagcgtc 3420 
tttacaggcc tcacccacat agacgcccat ttcttgtccc agactaagca ggcaggagac 3480 
aacttcccct acctggtagc ataccaggct acggtgtgcg ccagggctca ggctccacct 3540 
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ccatcgtggg accaaatgtg gaagtgtctc atacggctaa agcctacgct gcacgggcca 3600 
acgcccctgc tgtataggct gggagccgtt caaaacgagg ttactaccac acaccccata 3660 
accaaataca tcatggcatg catgtcggct gacctggagg tcgtcacgag cacctgggtg 3720 
ctggtaggcg gagtcctagc agctctggcc gcgtattgcc tgacaacagg cagcgtggtc 3780 
attgtgggca ggatcatctt gtccggaaag ccggccatca ttcccgacag ggaagtcctt 3840 
taccgggagt tcgatgagat ggaagagtgc gcctcacacc tcccttacat cgaacaggga 3900 
atgcagctcg ccgaacaatt caaacagaag gcaatcgggt tgctgcaaac agccaccaag 3960 
caagcggagg ctgctgctcc cgtggtggaa tccaagtggc ggaccctcga agccttctgg 4020 
gcgaagcata tgtggaattt catcagcggg atacaatatt tagcaggctt gtccactctg 4080 
cetggcaacc ccgcgatagc atcactgatg gcattcacag cctctatcac cagcccgctc 4140 
accacccaac ataccctcct gtttaacatc ctggggggat gggtggccgc ccaacttgct 4200 
cctcccagcg ctgcttctgc tttcgtaggc gccggcatcg ctggagcggc tgttggcagc 4260 
ataggccttg ggaaggtgct tgtggatatt ttggcaggtt atggagcagg ggtggcaggc 4320 
gcgctcgtgg cctttaaggt catgagcggc gagatgccct ccaccgagga cctggttaac 4380 
ctactccctg ctatcctctc ccctggcgcc ctagtcgtcg gggtcgtgtg cgcagcgata 4440 
ctgcgteggc acgtgggccc aggggagggg gctgtgcagt ggatgaaccg gctgatagcg 4500 
ttcgcttcgc ggggtaacca cgtctccccc acgcactatg tgcctgagag cgacgctgca 4560 
gcacgtgtca ctcagatcct ctctagtctt accatcactc agctgctgaa gaggcttcac 4 620 
cagtggatca acgaggactg ctccacgcca tgctccggct cgtggctaag agatgtttgg 4 680 
gattggatat gcacggtgtt gactgatttc aagacctggc tccagtccaa gctcctgccg 4740 
cgattgccgg gagtcccctt cttctcatgt caacgtgggt acaagggagt ctggcggggc 4800 
gacggcatca tgcaaaccac ctgcccatgt ggagcacaga tcaccggaca tgtgaaaaac 4 860 
ggttccatga ggatcgtggg gcctaggacc tgtagtaaca cgtggcatgg aacattcccc 4 920 
attaacgcgt acaccacggg cccctgcacg ccctccccgg cgccaaatta ttctagggcg 4 980 
ctgtggcggg tggctgctga ggagtacgtg gaggttacgc gggtggggga tttccactac 5040 
gtgacgggca tgaccactga caacgtaaag tgcccgtgtc aggttccggc ccccgaattc 5100 
ttcacagaag tggatggggt gcggttgcac aggtacgctc cagcgtgcaa acccctccta 5160 
cgggaggagg tcacattcct ggtcgggctc aatcaatacc tggttgggtc acagctccca 5220 
tgcgagcccg aaccggacgt agcagtgctc acttccatgc tcaccgaccc ctcccacatt 5280 
acggcg^aga cggctaagcg taggctggcc aggggatctc ccccctcctt ggccagctca 5340 
tcagctagcc agctgtctgc gccttccttg aaggcaacat gcactacccg tcatgactcc 5400 
ccggacgctg acctcatcga ggccaacctc ctgtggcggc aggagatggg cgggaacatc 5460 
acccgcgtgg agtcagaaaa taaggtagta attttggact ctttcgagcc gctccaagcg 5520 
gaggaggatg agagggaagt atccgttccg gcggagatcc tgcggaggtc caggaaattc 5580 
cctcgagcga tgcccatatg ggcacgcecg gattacaacc ctccactgtt agagtcctgg 5640 
aaggacccgg actacgtccc tccagtggta cacgggtgtc cattgccgcc tgccaaggcc 5700 
cctccgatac cacctccac^f gaggaagagg acggttgtcc tgtcagaatc taccgtgtct 5760 
tctgccttgg cggagctcgc cacaaagacc ttcggcagct ccgaatcgtc ggccgtcgac 5820 
agcggcacgg caacggcctc tcctgaccag ccctccgacg acggcgacgc gggatccgac 5880 
gttgaigtcgt actcctccat gccccccctt gagggggagc cgggggatcc cgatctcagc 5940 
gadgggtctt ggtctaccgt aagcgaggag gctagtgagg acgtcgtctg ctgctcgatg 6000 
tcctacacat ggacaggcgc cctgatcacg ccatgcgctg cggaggaaac caagctgccc 6060 
atcaatgcac tgagcaactc tttgctccgt caccacaact tggtctatgc tacaacatct 6120 
cgcagcgcaa gcctgcggca gaagaaggtc acctttgaca gactgcaggt cctggacgac 6180 
cactaccggg acgtgctcaa ggagatgaag gcgaaggcgt ccacagttaa ggctaaactt 6240 
ctatccgtgg aggaagcctg taagctgacg cccccacatt cggccagatc taaatttggc 6300 
tatggggcaa aggacgtccg gaacctatcc'agcaaggccg ttaaccacat ccgctccgtg 6360 
tggaaggact tgctggaaga cactgagaca ccaattgaca ccaccatcat ggcaaaaaat 6420 
gaggttttct gcgtccaacc agagaagggg ggccgcaagc cagctcgcct tatcgtattc 6480 
ccagatttgg gggttcgtgt gtgcgagaaa atggcccttt acgatgtggt ctccaccctc 6540 
cctcaggccg tgatgggctc ttcatacgga ttccaatact ctcctggaca gcgggtcgag 6600 
ttcctggtga atgcctggaa agcgaagaaa tgccctatgg gcttcgcata tgacacccgc 6660 
tgttttgact caacggtcac tgagaatgac atccgtgttg aggagtcaat ctaccaatgt 6720 
tgtgacttgg cccccgaagc cagacaggcc ataaggtcgc tcacagagcg gctttacatc 6780 
gggggccccc tgactaattc taaagggcag aactgcgigct atcgccggtg ccgcgcgagc 6840 
ggtgtactga cgaccagctg cggtaatacc ctcacatgtt acttgaaggc cgctgcggcc 6900 
tgtcgagctg cgaagctcca ggactgcacg atgctcgtat gcggagacga ccttgtcgtt 6960 
atctgtgaaa gcgcggggac ccaagaggac gaggcgagcc tacgggcctt cacggaggct 7020 
atgactagat actctgcccc ccctggggac ccgcccaaac cagaatacga cttggagttg 7080 
ataacatcat gctcctccaa tgtgtcagtc gcgcacgatg catctggcaa aagggtgtac 7140 
tatctcaccc gtgaccccac cacccccctt gcgcgggctg cgtgggagac agctagacac 7200 
actccagtca attcctggct aggcaacatc atcatgtatg cgcccacctt gtgggcaagg 7260 
atgatcctga tgactcattt cttctccatc cttctagctc aggaacaact tgaaaaagcc 7320 
ctagattgtc agatctacgg ggcctgttac tccattgagc cacttgacct acctcagatc 7380 
attcaacgac tccatggcct tagcgcattt tcactccata gttactctcc aggtgagatc 7440 
aatagggtgg cttcatgcct caggaaactt ggggtaccgc ccttgcgagt ctggagacat 7500 
cgggccagaa gtgtccgcgc taggctactg tcccaggggg ggagggctgc cacttgtggc 7560 
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aagtacctct tcaactgggc agtaaggacc aagctcaaac tcactccaat cccggctgcg 7620 
tcccagttgg atttatccag ctggttcgtt gctggttaca gcgggggaga catatatcac 7680 
agcctgtctc gtgcccgacc ccgctggttc atgtggtgcc tactcctact ttctgtaggg 7740 
; gtaggfcatct atctactccc caaccgatga acggggagct aaacactcca ggccaatagg 7800 
ccatcctgtt tttttccctt tttttttttc tttttttttt tttttttttt tttttttttt 7860 
ttttctcctt tttttttcct ctttttttcc ttttctttcc tttggtiggict ccatcttagc 7920 
cctagtcacg igctagctgtg aaaggtccgt gagccgcttg actgcagaga gtgctgatac 7980 
tggcctctct gcagatcaag t 8001 



CA 02303526 .2000-03-31 



<210> 4 
<211> 7989 
<212> DNA 

<213> Hepatitis C Virus 
<400> '4 

gccagccccc gattgggggc gacactccac catagatcac tbccctgtga ggaadtactg 60 
" tcttcacgca gaaagcgtct agccatggcg ttagtatgag tgtcgtgcag cctccaggac 120 
^ccccctccc gggagagcca tagtggtctg cggaaccgrgt gacjtacaccg gaattgccag 180 
gacgaccggg tcctttcttg gatcaacccg ctcaatgcct ggagatttgg gcgtgccccc 240 
gcgagactgc tagccgagta gtgttgggtc gcgaaaggcc ttgtggtact gcctgatagg 300 
gtgcttgcga gtgccccggg aggtctcgta gaccgtgcac catgagcacg aatcctaaac 360 
ctcaaagaaa aaccaaaggg cgcgccatga ttgaacaaga tggattgcac igcaggttctc 420 
cggccgcttg ggtggagagg ctattcggct atgactgggc acaacagaca atcggctgct 480 
ctgatgccgc cgtgttccgg ctgtcagcgc agggcycgccc ggttcttttt gtcaagaccg 540 
acctgtccgg tgccctgaat gaactgcagg acgaggcagc gcggctatcg tggctggcca 600 
cgacgggcgt tccttgcgca gctgtgctcg acgttgtcac tgaagcggga agggactggc 660 
tgctattggg cgaagtgccg gggcaggatc. tcctgtcatc tcaccttgct cctgccgaga 720 
aagtatccat catggctgat gcaatgcggc ggctgcatac gcttgatccg gctacctgcc 780 
cattcgacca ccaagcgaaa catcgcatcg agcgagcacg tactcggatg gaagccggtc 840 
ttgtcgatca ggatgatctg gacgaagagc atcaggggct cgcgccagcc gaactgttcg 900 
ccaggctcaa ggcgcgcatg cccgacggcg aggatctcgt cgtgacccat ggcgatgcct 960 
gcttgccgaa tatdatggtg gaaaatggcc gcttttctgg attcatcgac tgtggccggc 1020 
tgggtgtggc ggaccgctat caggacatag cgttggctac ccgtgatatt gctgaagagc 1080 
ttggcggcga atgggctgac cgcttcctcg tgctttacgg tatcgccgct cccgattcgc 1140 
agcgcatcgc cttctatcgc cttcttgacg agttcttctg agtttaaaca gaccacaacg 1200 
gtttccctct agcgiggatca attcccjcccc tctccctccc ccccccctaa cgttactggc 1260 
cgaagccgct tggaataagg ccggtgtgcg tttgtctata tgttattttc caccatattg 1320 
cc<jtcttttg gcaatgtgag ggcccggaaa cctggccctg tcttcttgac gagcattcct 1380 
aggggtcttt cccbtctcgc caaaggaatg caaggtctgt tgaatgtcgt gaaggaagca 14 40 
gttcctctgg aagcttcttg aagacaaaca acgtotgtag cgaccctttg caggcagcgg 1500 
aaccccccac ctggbgracag gtgcctctgc ggccaaaagc cacgtgtata agatacacct 1560 
gcaaaggcgg cacaacccba gtgccacgtt gtgagttgga tagttgtgga aagagtcaaa 1620 
tggctctcct caagcgtatt caacaagggg ctgaaggatg cccagaaggt accccattgt 1680 
atgggatctg atctggggcc tcggtgcaca tgctttacat gtgtttagtc gagigttaaaa 1740 
aacgtctagg ccccccgaac cacggggacg tggttttcct ttgaaaaaca cgataatacc 1800 
atggcgccta ttacggccta ctcccaacag acgcgaggcc tacttggctg catcatcact 1860 
agcctcacag gccgggacag gaaccaggtc gagggggagg tccaagtggt ctccaccgca 1920 
acacaatctt tcctggcgac ctgcgtcaat ggcgtgtgtt ggactgteta tcatggtgcc 1980 
ggctcaaaga cccttgccgg cccaaagggc ccaatcaccc aaatgtacac caatgtggac 2040 
caggacctcg tcggctggca agcgcccccc gggigcgcgtt ccttgacacc atgcacctgc 2100 
ggcagctcgfg acctttactt ggtcacgagg catgccgatg tcattccggt gcgccggcgg 2160 
ggcgacagca gggggagcct actctccccc aggcccgtct cctacttgaa gggctcttcg 2220 
ggcggtccac tgctctgccc ctcggggcac gctgtgggca tctttcgggc tgccgtgtgc 2280 
acccgagggg ttgcgaaggc ggtggacttt gtacccgtcg agtctatgga aadcactatg 2340 
cggtccccgg tcttcacgga caactcgtcc^cctccggccg taccgcagac attccaggtg 2400 
gcccatctac acgcccctac tggtagcggc aagagcacta aggtgcciggc tgcgtatgca 24 60 
gcccaagggt ataaggtgct tgtcctgaac ccgtccgtcg ccgccaccct aggtttcggg 2520 
gcigtatatgt ctaaggcaca tggtatcgac cctaacatca gaaccggg<gt aaggaccatc 2580 
accacgggtg cccccatcac gtactccacc tatggcaagt ttcttgccga cggtggttgc 2640 
tctgggggcg cctatgacat cataatatgt gatgagtgcc actcaactga ctcgaccact 2700 
atcctgggca tcggcacagt cctggaccaa gcggagacgg ctggagcgcg actcgtcgtg 2760 
ctcgccaccg ctacgcctcc gggatcggtc accgtgccac atccaaacat cgaggaggtg 2820 
gctctgtcca gcactggaga aatccccttt tatggcaaag ccatccccat cgagaccatc 2880 
aaggggggga ggcacctcat tttctgccat tccaagaaga aatgtgatga gctcgccgcg 2940 
aagctgtccg gcctcggact caatgctgta gcatattacc ggggccttga tgtatGcgtc 3000 
ataccaacta gcggagacgt cattgtcgta gcaacggacg ctctaatgac ^ggctttacc 3060 
ggcgatttcg actcagtgat cgactgcaat acatgtgtca cccagacagt cgacttcagc 3120 
ctggacccga ccttcaccat tgagacgacg accgtgccac aagacgcggt gtcacgctcg 3180 
cagcggcgag gcaggactgg taggggcagg atgggcattt acaggtttgt gactccagga 3240 
gaacggccct cgggcatgtt cgattcctcg gttctgtgcg agtgctatga cgcgggctgt 3300 
gcttggtacg agctcacgcc cgccgagacc tcagttaggt tgcgggctta cctaaacaca 3360 
ccagggttgc ccgtctgcca ggaccatctg gagttctggg agagcgtctt tacaggcctc 3420 
acccacatag acgcccattt cttgtcccag actaagcagg caggagacaa cttcccctac 3480 
ctggtagcat accaggctac ggtgtgcgcc agggctcagg ctccacctcc atcgtgggac 3540 
caaatgtgga agtgtctcat acggctaaag cctacgctgc acgggccaac gcccctgctg 3600 
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tataggctgg gagccgttca aaacgaggtt actaccacac accccataac caaatacatc 3660 
atggcatgca tgtcggctga cctggaggtc gtcacgagca cctgggtgct ggtaggcgga 3720 
gtcctagcag ctctggccgc gtattgcctg acaacaggca gcgtggtcat tgtgggcagg 3780 
atcatcttgt ccggaaagcc ggccatcatt cccgacaggg aagtccttta ccgggagttc 3840 
gatgagatgg aagagtgcgc ctcacacctc ccttacatcg aacagggaat gcagctcgcc 3900 
gaacaattca aacagaaggc aatcgggttg ctgcaaacag ccaccaagca agcggaggct 3960 
gctgctcccg tggtggaatc caagtggcgg accctcgaag ccttctgggc gaagcatatg 4020 
tggaatttca tcagcggigat acaatattta gcaggcttgt ccactctgcc tggcaacccc 4080 
gcgatagcat cact^atggc attcacagcc tc fat caeca gcccgctcac cacccaacat 4140 
accctcctgt ttaacatcct ggggggatgg gtggccgccc aacttgctcc tcccagcgct 4200 
gettctgett tegtaggege cggcatcgct ggagcggctg ttggcagcat aggccttggg .4260 
aaggtgcttg tggatatttt ggcaggttat ggagcagggg tggcaggege gctcgtggcc 4320 
tttaaggtca tgageggega gatgccctcc accgaggacc tggttaacct. actccctgct 4380 
atcctctccc ctggcgccct agtcgtcggg gtcgtgtgcg cagegatact gcgtcggcac 4 440 
gtgggcccag gggagggggc tgtgcagtgg atgaacegge tgatagcgtt cgcttcgcgg 4500 
ggtaaccacg tctcccccac gcactatgtg ectgagageg acgctgcagc acgtgtcact 4560 
cagatcctct ctagtcttac catcactcag ctgetgaaga ggcttcacca gtggatcaac 4620 
gaggactget ccacgccatg ctccggctcg tggctaagag atgtttggga ttggatatgc 4680 
. acggtgttga ctgatttcaa gacctggctc cagtccaagc tcctgccgcg attgeeggga 4740 
gtccccttct tctcatgtca acgtgggtac aagggagtct ggeggggega eggcatcatg 4800 
caaaccacct gcccatgtgg agcacagatc aceggacatg tgaaaaaegg ttccatgagg 4860 
atcgtggggc ctaggacctg tagtaacacg tggcatggaa cattccccat taacgegtae 4920 
accacgggcc cctgcacgcc ctccccggcg ccaaattatt etagggeget gtgfcfcgggtg 4980 
getgetgagg agtacgtgga ggttacgcgg gtgggggatt tccactacgt gaegggcatg 5040 
accactgaca aegtaaagtg cccgtgtcag gttccggccc ccgaattctt cacagaagtg 5100 
gatggggtgc ggttgcacag gtacgctcca gcgtgcaaac ccctcctacg ggaggaggtc 5160 
acattcctgg tegggctcaa tcaatacctg gttgggtcac agctcccatg cgagcccgaa 5220 
ceggaegtag cagtgctcac ttccatgctc accgacccct cccacattac ggcggagacg 5280 
getaagegta ggctggccag gggatctccc ccctccttgg ccagctcatc agetagecag 5340 
ctgtctgcgc cttccttgaa ggcaacatgc actacccgtc atgactcccc ggaegctgae 5400 
ctcatcgagg ccaacctcct gtggcggcag gagatgggcg ggaacatcac ccgcgtggag 54 60 
tcagaaaata aggtagtaat tttggactct ttcgagccgc tecaagegga ggaggatgag 5520 
agggaagtat ccgttccggc ggagatcctg eggaggtcca ggaaattccc tegagegatg 5580 
. cccatatggg cacgcccgga ttacaaccct ccactgttag agtcctggaa ggacccggac 5640 
tacgtccctc cagtggtaca cgggtgtcca ttgccgcctg ccaaggcccc tccgatacca 5700 
.cctccacgga ggaagaggac ggttgtcctg tcagaatcta ccgtgtcttc tgccttggcg 5760 
gagctcgcca caaagacctt cggcagctcc gaategtegg ccgtcgacag cggcacggca 5820 
acggcctctc ctgaccagcc ctccgacgac ggegacgegg gatccgac^t tgagtegtac. 5880 
tcctccatgc ccccccttga gggggagccg ggggatcccg atetcagega egggtcttgg 5940 
tetacegtaa gcigaggaggc tagtgaggac gtegtctget gctcgatgtc ctacacatgg 6000 
acaggcgccc tgatcacgcc atgcgctgcg gaggaaacca agctgcccat caatgeactg 6060 
agcaactctt tgctccgtca ccacaacttg gtctatgeta caacatctcg cagcgcaagc 6120 
ctgeggcaga agaaggtcac ctttgacaga ctgcaggtcc tggacgacca ctaccgggac 6180 
gtgetcaagg agatgaaggc gaaggegtec acagttaagg ctaaacttct atccgtggag 6240 
gaagcctgta agctgacgcc cccacattcg gecagatcta aatttggcta tggggcaaag 6300 
gaegtcegga acctatccag caaggcegtt aaccacatcc gctccgtgtg gaaggacttg 6360 
ctggaagaca ctgagacacc aattgacacc w accatcatgg caaaaaatga ggttttctgc 6420 
gtccaaccag agaagggggg ccgcaagcca gctcgcctta tcgtattccc agatttgggg 6480 
gttcgtgtgt gcgagaaaat ggccctttac gatgtggtct ccaccctccc teaggcegtg 6540 
atgggctctt cataeggatt ccaatactct cctggacagc gggtcgagtt cctggtgaat 6600 
gcctggaaag cgaagaaatg ccctatgggc ttcgcatatg acacccgctg ttttgactca 6660 
aeggtcactg agaatgacat ccgtgttgag gagtcaatct accaatgttg tgacttggcc 6720 
cccgaagcca gaeaggecat aaggtcgetc acagagegge tttacategg gggccccctg 6780 
: actaattcta aagggcagaa ctgeggctat cgccggtgcc gegegagegg tgtactgacg 6840 
accagctgcg gtaataccct cacatgttac ttgaaggccg ctgcggcctg tegagctgeg 6900 
aagctccagg actgeacgat getegtatge ggagacgacc ttgtcgttat ctgtgaaagc 6960 
gcggggaccc aagaggacga ggegagecta cgggccttca eggaggctat gactagatac 7020 
tctgcccccc ctggggaccc gcccaaacca gaatacgact tggagttgat aacatcatgc 7080 
tcctccaatg tgtcagtege geacgatgea tctggcaaaa gggtgtacta tctcacccgt 7140 
gaccccacca ccccccttgc gcgggctgcg tgggagacag ctagacacac tccagtcaat 7200 
tcctggctag gcaacatcat catgtatgcg cccaccttgt gggcaaggat gatcctgatg 7260 
actcatttct tctccatcct tctagctcag gaacaacttg aaaaagcect agattgtcag 7320 
atetaegggg cctgttactc cattgageca cttgacctac ctcagatcat tcaacgactc 7380 
catggectta gegcatttte actccatagt tactctccag gtgagatcaa tagggtggct 7440 
tcatgcctca ggaaacttgg ggtaccgccc ttgegagtet ggagacatcg ggccagaagt 7500 
gtccgcgcta ggctactgtc ccaggggggg agggctgeca cttgtggcaa gtacctcttc 7560 
aactgggcag taaggaccaa gctcaaactc actccaatcc cggctgcgtc ccagttggat 7620 



CA 02303526 2000-03-31 



ttatccagct ggttcgttgc tggttacagc gggggagaca tatatcacag cctgtctcgt 7680 

gcccgacccc gctggttcat gtggtgccta ctcctacttt ctgtaggggt aggcatctat 7740 

ctactcccca accgatgaac ggggagrctaa acactccagg ccaataggcc atcctgtttt 7800 

tttccctttt tttttttctt tttttttttt tttttttttt tttttttttt ttctcctttt 7860 

tttttcctct ttttttcctt ttctttcctt tggtggctcc atcttagccc tagtcacggc 7920 

tagctgtgaa aggtccgtga gccgcttgac tgcagagagt gctgatactg gcctctctgc 7980 

agatcaagt 7 98 9 
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<210> 5 
<211> 8649 
<212> DNA 

<213> Hepatitis C Virus 
<400> 5 

. gccagcccdc gattgggggc gacactccac catagatcac tcccctgtga ggaactactg 60 
tcttcacgca gaaagcgtct agccatggcg ttagtatgag tgtcgtgcag cctccaggac 120 
cbcccctccc gggagagcca tagtggtctg cggaaccggt gagtacaccg gaattgccag 180 
gacgaccggg tcctttcttg gatcaacccg ctcaatgcct ggagatttgg gcgtgccccc 240 
gc^agactgc tagccgagta gtgttgggtc gcgaaaggcc ttgtggtact gcctgatagg 300 
gtgcttgcga gtgccccggg aggtctcgta gaccgtgcac catgagcacg aatcctaaac 360 
ctcaaagaaa aaccaaacgt aacaccaacg ggcgcgccat gattgaacaa gatggattgc 420 
acgcaggttc tccggccgct tgggtggaga ggctattcgg ctatgactgg gcacaacaga 480 
caatcggctg ctctgatgcc gccgtgttcc ggctgtcagc gcaggggcgc ccggttcttt 540 
ttgtcaagac cgacctgtcc ggtgccctga atgaactgca ggacgaggca gcgcggctat 600 
cgtggctggc cacgacgggc gttccttgcg cagctgtgct cgacgttgtc actgaagcgg 660 
gaagggactg gctgctattg ggcgaagtgc cggggcagga tctcctgtca tctcaccttg 720 
ctcctgccga gaaagtatcc atcatggctg atgcaatgcg gcggctgcat acgcttgatc 780 
cggctacctg cccattcgac caccaagcga aacatcgcat cgagcgagca cgtactcgga 840 
tggaagcc^g tcttgtcgat caggatgatc tggacgaaga gcatcagggg ctcgcgccag 900 
ccgaactgtt cgccaggctc aaggcgcgca tgcccgacgg cgaggatctc gtcgtgaccc 960 
atggcgatgc ctgcttgccg aatatcatgg tggaaaatgg ccgcttttct ggattcatcg 1020 
actgtggccg gctgggtgtg gcggaccgct atcaggacat agcgttggct acccgtgata 1080 
ttgctgaaga gcttggcggc gaatgggctg accgcttcct cgtgctttac ggtatcgccg 1140 
ctcccgattc gcagcgcatc gccttctatc gccttcttga cgagttcttc tgagtttaaa 1200 
cagaccacaa cggtttccct ctagcgggat caattccgcc cctctccctc ccccccccct 1260 
aacgttactg gccgaagccg cttggaataa ggccggtgtg cgtttgtcta tatgttattt 1320 
tccaccatat tgccgtcttt tggcaatgtg agggcccgga aacctggccc tgtcttcttg 1380 
acgagcattc ctaggiggtct ttcccctctc gccaaaggaa tgcaaggtct gttgaatgtc 1440 
gtgaaggaag cagttcctct ggaagcttct tgaagacaaa caacgtctgt agcgaccctt 1500 
tgcagcfcagc ggaaccbccc acctggcgac aggtgcctct gcggccaaaa gccacgtgta 1560 
taagatacac ctgcaaaggc ggcacaaccc cagtgccacg ttgtgagttg gatagttgtg 1620 
gaaagagtca aatggctctc ctcaagcgta ttcaacaagg ggctgaagga tgcccagaag 1680 
gtaccccatt gtatgggatc tgatctgggg cctcggtgca catgctttac atgtgtttag 1740 
tcgaggttaa aaaacgtcta ggccccccga accacgggga cgtggttttc ctttgaaaaa 1800 
cacgataata ccatggaccg ggagatggca gcatcgtgcg gaggcgcggt tttcgtaggt 1860 
ctgatactct tgaccttgtc accgcactat aagctgttcc tcgctaggct catatggtgg 1920 
ttacaatatt ttatcaccag ggccgaggca cacttgcaag tgtggatccc ccccctcaac 1980 
. gttcgggggg gccgcgatgc cgtcatcctc ctcacgtgcg cgatccaccc agagctaatc 2040 
tttaccatea ccaaaatctt gctcgccata ctcggtccac tcatggtgct ccaggctggt 2100 
ataaccaaag tgccgtactt cgtgcgcgca cacgggctca ttcgtgcatg catgctggtg 2160 
cggaaggttg ctgggiggtca ttatgtccaa atggctctca tgaagttggc cgcactgaca 2220 
ggtacgtacg tttatgacca tctcacccca ctgcgggact gggcccacgc gggcctacga 2280 
gaccttgcgg tggcagttga gcccgtcgtc ttctctgata tggagaccaa ggttatcacc 2340 
tggggggGag acaccgcggc gtgtggggacf'atcatcttgg gcctgcccgt ctccgcccgc 2400 
agggggaggg agatacatct gggaccggca gacagccttg aagggcaggg gtggcgactc 24 60 
ctcgcgccta ttacggccta ctcccaacag acgcgaggcc tacttggctg catcatcact 2520 
agcctcacag gccgggacag gaaccaggtc gagggggagg tccaagtggt ctccaccgca 2580 
acacaatctt tcctggcgac ctgcgtcaat ggcgtgtgtt ggactgtcta tcatggtgcc 2640 
ggctcaaaga cccttgccgg cccaaagggc ccaatcaccc aaatgtacac caatgtggac 2700 
caggacctcg tcggctggca agcgcccccc ggggcgcgtt ccttgacacc atgcacctgc 2760 
ggcagctcgg acctttactt ggtcacgagg catgccgatg tcattccggt gcgccggcgg 2820 
ggcgacagca gggggagcct actctccccc aggcccgtct cctacttgaa gggctcttcg 2880 
ggcggtccac tgctctgccc ctcggggcac gctgtgggca tctttcgggc tgccgtgtgc 2940 
acccgagggg ttgcgaaggc ggtggacttt gtacccgtcg agtctatgga aaccactatg 3000 
cggtccccgg tcttcacgga caactcgtcc cctccggccg taccgcagac atitfccaggtg 3060 
gcccatctac acgcccctac tggtagcggc aagagcacta aggtgccggc tgcgtatgca 3120 
gcccaagggt ataaggtgct tgtcctgaac ccgtccgtcg ccgccaccct aggtttcggg 3180 
gcgtatatgt ctaaggcaca tggtatcgac cctaacatca gaaccggggt aaggaccatc 3240 
accacgggtg cccccatcac gtactccacc tatggcaagt ttcttgccga cggtggttgc 3300 
tctgggggcg cctatgacat cataatatgt gatgagtgcc actcaactga ctcgaccact 3360 
atcctgggca tcggcacagt cctggaccaa gcggagacgg ctggagcgcg actcgtcgtg 3420 
ctcgccaccg ctacgcctcc gggatcggtc accgtgccac atccaaacat cgaggaggtg 3480 
gctctgtcca gcactggaga aatccccttt tatggcaaag ccatccccat cgagaccatc 3540 
aaggggggga ggcacctcat tttctgccat tccaagaaga aatgtgatga gctcgccgcg 3600 
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aagctgtccg gcctcggact caatgctgta gcatattacc ggggccttga tgtatccgtc 3660 
ataccaacta gcggagacgt cattgtcgta gcaacggacg ctctaatgac gggctttacc 3720 
ggcgatttcg actcagtgat cgactgcaat acatgtgtca cccagacagt cgacttcagc 3780 
ctggacccga ccttcaccat tga^acgacg accgtgccac aagacgcggt gtcacgctcg 3840 
cagcggcgag gcaggactgg taggggcagg atgggcattt acaggtttgt gactccagga 3900 
gaacggccct cgggcatgtt cgattcctcg gttctgtgcg agtgctatga cgcgggctgt 3960 
gcttggtacg agctcacgcc cgccgagacc tcagttaggt tgcgggctta cctaaacaca 4020 
ccagggttgc ccgtctgcca ggaccatctg gagttctggg agagcgtctt tacaggcctc 4080 
acccacatag acgcccattt cttgtcccag actaagcagg caggagacaa cttcccctac 4140. 
ctggtagcat accaggctac ggtgtgcgcc agggctcagg ctccacctcc' atcgtgggac 4200 
: caaatgtgga agtgtctcat acggctaaag cctacgctgc acgggccaac gcccctgctg 42 (50 
tataggctgg igagccgttca aaacgaggtt actaccacac accccataac caaatacatc 4320 
atggcatgca tgtcggctga cctggaggtc gtcacgagca cctgggtgct ggtaggcgga 4380 
gtcctaigcag ctctggccgc gtattgcctg acaacaggca gcgtggtcat tgtgggcagg 44 40 
atcatcttgt ccggaaagcc ggccatcatt cccgacaggg aagtccttta ccgggagttc 4500 
gatgagatgg aagagtgcgc ctcacacctc ccttacatcg aacagggaat gcagctcgcc 4560 
gaacaattca aacagaaggc aatcgggttg ctgcaaacag ccaccaagca agcggaggct 4620 
gctgctcccg tggtggaatc caagtggcgg accctcgaag ccttctgggc gaagcatatg 4 680 
tggaatttca tcagcgggat acaatattta gcaggcttgt ccactctgcc tggcaacccc 4740 
gcgatagcat cactgatggc attcacagcc tctatcacca gcccgctcac cacccaacat 4800 
accctcctgt ttaacatcct ggggggatrgg gtggccgccc aacttgctcc tcccagcgct 4860 
gcttctgctt tcgtaggcgc cggcatcgct ggagcggctg ttggcagcat aggccttggg 4920 
aaggtgcttg tggatatttt ggcaggttat ggagcagggg tggcaggcgc gctcgtggcc .4980 
tttaaggtca tgagcggcga gatgccctcc accgaggacc tggttaacct actccctgct 5040 
atcctctccc ctggcgccct agtcgtcggg gtcgtgtgcg cagcgatact gcgtcggcac 5100 
gtgggcccag gggagggggc tgtgcagtgg atgaaccggc tgatagcgtt cgcttcgcgg 5160 
ggtaaccacg tctcccccac gcactatgtg cctgagagcg acgctgcagc acgtgtcact 5220 
cagatcctct ctagtcttac catcactcag ctgctgaaga ggcttcacca gtggatcaac 5280 
gaggactgct ccacgccatg ctccggctcg tggctaagag atgtttggga ttggatatgc 5340 
acgtjtgttga ctgatttcaa gacctggctc cagtccaagc tcctgccgcg attgccggga 5400 
gtccccttct tctcatgtca acgtgggtac aagggagtct ggcggggcga cggcatcatg 5460 
caaacdacct gcccatgtgg agcacagatc accggacatg tgaaaaacgg ttccatgagg 5520 
atcgtggiggc ctaggacctg tagtaacacg tggcatggaa cattccccat taacgcgtac 5580 
accacgggcc cctgcacgdc ctccccggcg ccaaattatt ctagggcgct gtggcgggtg 5640 
gctgctgagg agtacgtgga ggttacgcgg gtgggggatt tccactacgt gacgggcatg 5700 
accactgaca acgtaaagtg cccgtgtcag gttccggccc ccgaattctt cacagaagtg 5760 
gatggggtgc ggttgcacag gtacgctcca gcgtgcaaac ccctcctacg ggaggaggtc 5820 
acattcctgg tcgggctcaa tcaatacctg gttgggtcac agctcccatg cgagcccgaa 5880 
ccggacgtag cagtgctcac : ttccatgctc accgacccct cccacattac ggcggagacg 5940 
gctaagcgta ggctggccag gggatctccc ccctccttgg ccagctcatc agctagccag 6000 
ctgtctgcgc cttccttgaa ggcaacatgc actacccgtc atgactcccc ggacgctgac 6060 
ctcatcgagg ccaacctcct gtggcggcag gagatgggcg ggaacatcac ccgcgtggag 6120 
tcagaaaata aggtagtaat tttggactct ttcgagccgc tccaagcgga ggaggatgag 6180 
agggaagtat ccgttccggc ggagatcctg cggaggtcca ggaaattccc tcgagcgatg 6240 
cccatatggg cacgcccgga ttacaaccct ccactgttag agtcctggaa ggacccggac 6300 
tacgtccctc cagtggtaca cgggtgtcca w ttgccgcctg ccaaggcccc tccgatacca 6360 
cctccacgga ggaagaggac ggttgtcctg tcagaatcta ccgtgtcttc tgccttggcg 6420 
gagctcgcca caaagacctt cggcagctcc gaatcgtcgg ccgtcgacag cggcacggca 6480 
acggdctctc ctgaccagcc ctccgacgac ggcgacgcgg gatccgacgt tgagtcgtac 6540 
tcctccatgc ccccccttga gggggagccg ggggatcccg atctcagcga cgggtcttgg 6600 
tctaccgtaa gcgaggaggc tagtgaggac gtcgtctgct gctcgatgtc ctacacatgg 6660 
acaggcgccc tgatcacgcc atgcgctgcg gaggaaacca agctgcccat caatgcactg 6720 
agcaactctt tgbtccgtca ccacaacttg gtctatgcta caacatctcg cagcgcaagc 6780 
ctgcggcaga agaaggtcac ctttgacaga ctgcaggtcc tggacgacca ctaccgggac 63340 
gtgctcaagg agatgaaggc gaaggcgtcc acagttaagg ctaaacttct atccgtggag 6900 
gaagcctgta agctgacgcc cccacattcg gccagatcta aatttggcta tggggcaaag 6960 
gacgtccgga acctatccag caaggccgtt aaccacatcc gctccgtgtg gaaggacttg 7020 
ctggaagaca ctgagacacc aattgacacc accatcatgg caaaaaatga ggttttctgc 7080 
gtccaaccag agaagggggg ccgcaagcca gctcgcctta tcgtattccc agatttgggg 7140 
gttcgtgtgt gcgagaaaat ggccctttac gatgtggtct ccaccctccc tcaggccgtg 7200 
atgggctctt catacggatt ccaatactct cctggacagc gggtcgagtt cctggtgaat 7260 
gcctggaaag cgaagaaatg ccctatgggc ttcgcatatg acacccgctg ttttgactca 7320 
acggtcactg agaatgacat ccgtgttgag gagtcaatct accaatgttg tgacttggcc 7380 
cccgaagcca gacaggccat aaggtcgctc acagagcggc tttacatcgg gggccccctg 7440 
actaattcta aagggcagaa ctgcggctat cgccggtgcc gcgcgagcgg tgtactgacg 7500 
accagctgcg gtaataccct cacatgttac ttgaaggccg ctgcggcctg tcgagctgcg 7560 
aagctccagg actgcacgat gctcgtatgc ggagacgacc ttgtcgttat ctgtgaaagc 7620 
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gcggggaccc aagaggacga ggcgagccta cgggccttca cggaggctat gactagatac 7680 
tctgcccccc ctggggaccc gcccaaacca gaatacgact tggagttgat aacatcatgc 7740 
tcctccaatg tgtcagtcgc gcacgatgca tctggcaaaa gggtgtacta tctcacccgt 7800 
gaccceacca ccccccttgc gcgggctgcg tgggagacag ctagacacac tccagtcaat 7860 
tcctggctag gcaacatcat catgtatgcg cccaccttgt gggcaaggat gatcctgatg 7920 
actcatttct tctccatcct tctagctcag gaacaacttg aaaaagccct agattgtcag 7980 
atctacgggg cctgttactc cattgagcca cttgacctac ctcagatcat tcaacgactc 8040 
catggcctta gcgcattttc actccatagt tactctccag gtgagatcaa tagggtggct 8100 
tcatgcctca ggaaacttgg ggtaccgccc ttgcgagtct ggagacatcg ggccagaagt 8160 
gtccgcgcta ggctactgtc ccaggggggg agggctgcca cttgtggcaa gtacctcttc 8220 
aactgggcag taaggaccaa gctcaaactc actccaatcc cggctgcgtc ccagttggat 8280 
ttatccagct ggttcgttgc tggttacagc gggggagaca tatatcacag cctgtctcgt 8340 
gccfcgaccce gctggttcat gtggtgccta ctcctacttt ctgtaggggt aggcatctat 8400 
ctactcccca accgatgaac ggggagctaa acactccagg ccaataggcc atcctgtttt 8460 
tttccctttt tttttttctt tttttttttt ttt'ttttttt tttttttttt ttctcctttt 8520 
tttttcctct ttttttcctt ttctttcctt tggtggctcc atcttagccc tagtcacggc 8580 
tagctgtgaa aggtccgtga gccgcttgac tgcagagagt gctgatactg gcctctctgc iB640 
agatcaagt " 8649 
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<210> 6 
<211> 8001 
<212> DNA 

<:213> Hepatitis C Virus 
: <400> 6 

gccagecccc gattggcj'£gc gacactccac catagatcac tcccctgtga ggaactactg . 60 
tcttcacgca gaaagcgtct agccatggcg ttagtatgag tgtcgtgcag cctccaggac 120 
cccccctccc gggagagcca tagtggtctg cggaaccggt gagtacaccg gaattgccag 180 
gacgaccggg tcctttcttg gatcaacccg ctcaatgcct ggagatttgg gcgtgccccc 240 
gcgagadtgc tagccgagta gtgttgggtc gcgaaaggcc ttgtggtact gcctgatagg 300 
gtgcttgcga gtgccccggg aggtctcgta gaccgtgcac catgagcacg aatcctaaac 360 
ctcaaagaaa aaccaaacgt aacaccaacg ggcgcgccat gattgaacaa gatggattgc 420 
acgcaggttc tccggccgct tgggtggaga ggctattcgg ctatgactgg gcacaacaga 480 
caatcggctg ctctgatgcc gccgtgttcc ggctgtcagc gcaggggcgc ccggttcttt 540 
ttgtcaagac cgacctgtcc ggtgccctga atgaactgca gigacgaggca gcgcggctat 600 
cgtggctggc cac'gacgggc gttccttgcg cagctgtgct cgacgttgtc actgaagcgg 660 
gaagggactg gctgctattg ggcgaagtgc cggggcagga tctcctgtca tctcaccttg 720 
ctcctgccga gaaagtatcc atcatggctg atgcaatgcg gcggctgcat acgcttgatc 780 
cggctacctg cccattcgac caccaagcga aacatcgcat cgagcgagca cgtactcgga 840 
tggaagccgg tcttgtcgat caggatgatc tgga.cgaaga gcatcagggg ctcgcgccag 900 
ccgaactgtt cgccaggctc aaggcgcgca tgcccgacgg cgaggatctc gtcgtgaccc 960 
atggcgatgc ctgcttgccg aatatcatgg tggaaaatgg ccgcttttct ggattcatcg 1020 
actgtggccg gctgggtgtg gcggaccgct atcaggacat agcgttggct acccgtgata 1080 
ttgctgaaga gcttggcggc gaatgggctg accgcttcct cgtgctttac ggtatcgccg 1140 
•ctcccgattc gcagcgcatc gccttctatc gccttcttga cgagttcttc tgagtttaaa 1200 
cagaccacaa cggtttccct ctagcgggat caattccgcc cctctccctc ccccccccct 1260 
aacgttactg gccgaagccg cttggaataa ggccggtgtg cgtttgtcta tatgttattt 1320 
tccaccatat tgccgtcttt tggcaatgtg agggcccgga aacctggccc tgtcttcttg 1380 
acgagcattc ctaggggtct ttcccctctc gccaaaggaa tgcaaggtct gttgaatgtc. 1440 
gtgaaggaag cagttcctct ggaagcttct tgaagacaaa caacgtctgt agcgaccctt 1500 
tgdaggcagc ggaacccccc acctggcgac aggtgcctct gcggccaaaa gccacgtgta 1560 
taagatacac ctgcaaaggb. ggcacaaccc cagtgccacg ttgtgagttg gatagttgtg 1620 
gaaagagtca aatggctctc ctcaagcgta ttcaacaagg ggctgaagga tgcccaigaag 1680 
gtaccccatt gtatgggatc tgatctgggg cctcggtgca catgctttac atgtgtttag 1740 
tcgaggttaa aaaacgtcta ggccccccga accacgggga cgtggttttc ctttgaaaaa 1800 
cacgataata ccatggcgcc tattacggcc tactcccaac agacgcgagg cctacttggc 1860 
tgcatcatca ctagcctcac aggccgggac aggaaccagg tcgaggggga ggtccaagtg 1920 
gtctccaccg caacacaatc tttcctggcg acctgcgtca atggcgtgtg ttggactgtc 1980 
tatcatggtg ccggctcaaa gacccttgcc ggcccaaagg gcccaatcac ccaaatgtac 2040 
adcaatgtgg accaggacct cgtcggctgg caagcgcccc ccggggcgcg ttccttgaca 2100 
ccatgcacct gcggcagctc ggacctttac ttggtcacga ggcatgccga tgtcattccg 2160 
gtgcgccggc ggggcgacag cagggggagc ctactctccc ccaggcccgt ctcctacttg 2220 
aagggctctt cgggcggtcc actgctctgc ccctcggggc atgctgtggg catctttcgg 2280 
gctgccgtgt gcacccgagg ggttgcgaag.,gcggtggact ttgtacccgt cgagtctatg 2340 
gaaaccacta tgcggtcccc ggtcttcacg gacaactcgt cccctccggc cgtaccgcag 2400 
acattccagg tggcccatct acacgcccct actggtagcg gcaiagagcac taaggtgccg 24 60 
gctgcgtatg cagcccaagg gtataaggtg cttgtcctga acccgtccgt cgccgccacc 2520 
ctaggtttcg gggcgtatat gtctaaggca catggtatcg accctaacat cagaaccggg 2580 
gtagggacca tcaccacggg. tgcccccatc acgtactcca cctatgigcaa gtttcttgcc 2640 
gacggtggtt gctctggggg cgcctatgac atcataatat gtgatgagtg ccactcaact 2700 
gactcgacca ctatcctggg catcggcaca gtcctggacc aagcggagac ggctggagcg 2760 
cgactcgtcg tgctcgccac cgctacgcct ccgggatcgg tcaccgtgcc acatccaaac 2820 
atcgaggagg tggctctgtc cagcactgga gaaatcccct tttatggcaa agccatcccc 2880 
atcgcgacca tcaagggg^g gaggcacctc attttctgcc attccaagaa gaaatgtgat 2940 
gagctcgccg cgaagctatc cggcctcgga ctcaatgctg tagcatatta ccggggcctt 3000 
gatgtatccg tcataccaac tagcggagac gtcattgtcg tagcaacgga cgctctaatg 3060 
acgggcttta ccggcgattt cgactcagtg atcgactgca atacatgtgt cacccagaca 3120 
gtcgacttca gcctggaccc gaccttcacc attgagacga cgaccgtgcc acaagacgcg 3180 
gtgtcacgdt cgcagcggcg aggcaggact ggtaggggca ggatgggcat ttacaggttt 3240 
gtgactccag gagaacggcc ctcgggcatg ttcgattcct cggttctgtg cgagtgctat 3300 
gacgcgggct gtgcttggta cgagctcacg cccgccgaga cctcagttag gttgcgggct 3360 
tacctaaaca caccagggct gcccgtctgc caggaccatc tggagttctg ggagagcgtc 3420 
tttacaggcc tcacccacat agacgcccat ttcttgtccc agactaggca ggcaggagac 3480 
aacttcccct acctggtagc ataccaggct acggtgtgcg ccagggctca ggctccacct 3540 
ccatcgtggg accaaatgtg ggagtgtctc atacggctaa agcctacgct gcacgggcca 3600 
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acgdccdtgc tgtataggct gggagccgtt caaaacgagg ttactaccac acaccccata 3660 
accaaataca tcatggcatg catgtcggct gacctggagg tcgtcacgag cacctgggtg 3720 
ctgcjtaggcg gagtcctagc agctctggcc gcgtattgcc tgacaacagg cagcgtggtc 3780 
attgtgggca ggatcatctt gtccggaaag ccggccatca ttcccgacag ggaagtcctt 3840 
taccgggagt tcgatgagat ggaagagtgc gcctcacacc tcccttacat cgaacaggga 3900 
atgcagctcg ccgaacaatt caaacagaag gcaatcgggt tgctgcaaac agccaccaag 3960 
caagcggagg ctgctgctcc cgtggtggaa tccaagtggc ggaccctcga agccttctgg 4020 
. gcgaagcata tgtggaattt catcagcggg atacaatatt tagcaggctt gtccactctg 4080 
cctggcaacc ccgcgatagc atcactgatg gcattcacag cctctatcac cagcccgctc 4140 
accacccaac ataccctcct ^tttaacatc ctggggggat gggtggccgc ccaacttgct 4200 
cctcccagcg ctgcttctgc tttcgtaggc gccggcatcg ctggagcg^c tgttggcagd 4260 
ataggccttg ggaaggtgct tgtggatatt ttggcaggtt atggagcagg ggtggdaggc 4320 
gcgctcgtgg cctttaaggt catgagcggc gagatgccct ccaccgagga cctggttaac 4380 
ctactccctg ctatcctctc ccctggcgcc ctagtcgtcg gggtcgtgtg cgcagcgata 4440 
ctgcgtcggc acgtggJgccc aggggagggg gctgtgcagt ggatgaaccg gbtgatagcg 4500 
ttcigcttcgc ggggtaacca cgtctccccc acgcactatg tgtctgagag cgacgctgta 4560 
gcacgtgtca ctcagatcct ctctagtctt accatcactc agctgctgaa gaggcttcac 4 620 
cagtggatca acgaggactg ctccacgcca tgctccggct cgtggctaag agatgtttgg 4680 
gattggatat gcacggtgtt gactgatttc aagacctggc tccagtccaa gctcctgccg 4740 
cgattgccgg gagtcccctt cttctcatgt caacgtgggt acaagggagt ctggcggggc 4800 
gacggcatca tgcaaaccac ctgcccatgt ggagcacaga tcaccggaca tgtgaaaaac 4860 
ggttccatga ggatcgtggg gcctaggacc tgtagtaaca cgtggcatgg aacattcccc 4 920 
attaacgcgt acaccacggg. cccctgcacg ccctccccgg cgccaaatta ttctagggcg 4 980 
ctgtggcggg tggctgctga ggagtacgtg gaggttacgc gggtggggga tttccactac 5040 
gtgacgggca tgaccactga caacgtaaag tgcccgtgtc aggttccggc ccccgaattc 5100 
ttcacagaag tggatggggt gcggttgcac aggtacgctc cagcgtgcaa acccctccta 5160 
cgggaggagg tcacattcet ggtcgggctc aatcaatacc tggttgggtc acagctccca 5220 
tgcgggcccg aaccggacgt agcagtgctc acttccatgc tcaccgaccc ctcccacatt 5280 
acggcggaga cggctaagcg taggctggcc aggggatctc ccccctcctt ggccagctca 5340 
tcagctagcc agctgtctgc. gccttccttg aaggcaacat gcactacccg tcatgactcc 5400 
cdggacgctg acctcatcga ggccaacctc ctgtggcggc aggagatggg cgggaacatc 54 60 
acccgcgtgg agtcagaaaa taaggtagta attttggact ctttcgagcc gctccaagcg 5520 
gaggaggatg agagggaagt atccgttccg gcggagatcc tgcggaggtc caggaaattc 5580 
cctcgagcga tgcccatatg ggcacgcccg gattacaacc ctccactgtt agagtcctgg 5640 
aaggacccgg actacgtccc tccagtggta cacgggtgtc cattgccgcc tgccaaggcc 5700 
cctccgatac cacctccacg gagggagagg acggttgtcc tgtcagaatc taccgtgtct 5760 
tctgccttgg cggagctcgc cacaaagacc ttcggcagct ccgaatcgtc ggccgtcgac .5820 
agcggcacgg caacggcctc tcctgaccag ccctccgacg acggcgacgc gggatccgac 5880 
gttgagtcgt actcctccat gccccccctt gagggggagc cgggggattc cgatctcagc 5940 
gacgggtctt ggtctaccgt aagcgaggag gctagtgagg acgtcgtctg ctgctcgatg 6000 
tcctacacat ggacaggcgc cctgatcacg ccatgcgctg cggaggaaac caagctgccc 6060 
gtcaatgcac tgagcaactc tttgctccgt caccacaact tggtctatgc tadaacatct 6120 
cgcagcgcaa gcctgcggca gaagaaggtc acctttgaca gactgcaggt cctggacgac 6180 
cactaccggg acgtgctcaa ggagatgaag gcgaaggcgt ccacagttaa ggctaaaett 6240 
ctatccgtgg a^gaagcctg taagctgacg cccccacatt cggccagatc taaatttggc 6300 
tatggggcaa aggacgtccg gaacctatcc_ agcaaggccg ttaaccacat cicgctccgtg 6360 
tggaaggact tgctggaaga cactgagaca ccaattgaca ccaccatcat ggcaaaaaat 6420 
gaggttttct gcgtccaacd agagaagggg ggccgcaagc cagctcgcct tatcgtattc 6480 
ccagatttgg gggttcgtgt gtgcgagaaa atggcccttt acgatgtggt ctccaccctc 6540 
cctcaggccg tgatgggctc ttcatacgga ttccaatact ctcctggaca gcgggtcgag 6600 
ttcctggtga atgcctggaa agcgaagaaa tgccctatgg gcttcgcata tgacacccgc 6660 
tgttttgact caacggtcac tgagaatgac atccgtgttg aggagtcaat ctaccaatgt 6720 
tgtgacttgg cccccgaagc cagacaggcc ataaggtcgc tcacagagcg gctttacatc 6780 
gggggccccc tgactaattc taaagggcag aactgcggct atcgccggtg ccgcgcgagc 6840 
ggtgtactga cgaccagctg cggtaatacc ctcacatgtt atttgaaggc cgctgcggcc 6900 
tgtcgagctg cgaagctcca ggactgcacg atgctcgtat gcggagacga ccttgtcgtt 6960 
atctgtgaaa gcgcggggac ccaagaggac gaggcgagcc tacgggcctt cacggaggct 7020 
atgactagat actctgcccc ccctggggac ccgcccaaac cagaatacga cttggagttg 7080 
ataacatcat gctcctccaa tgtgtcagtc gcgcacgatg catctggcaa aagggtgtac 7140 
tatctcaccc gtgaccccac cacccccctt gcgcgggctg cgtgggagac agctagacac 7200 
actccagtca attcctiggct aggcaacatc atcatgtatg cgcccacctt gtgggcaagg 7260 
atgatcctga tgactcattt cttctccatc cttctagctc aggaacaact tgaaaaagcc 7320 
ctagattgtc agatctacgg ggcctgttac tccattgagc cacttgacct acctcagatc 7380 
attcaacgac tccatggcct tagcgcattt tcactccata gttactctcc aggtgagatc 7440 
aatagggtgg cttcatgcct caggaaactt ggggtaccgc ccttgcgagt ctggagacat 7500 
cgggccagaa gtgtccgcgc taggctactg tcccaggggg ggagggctgc cacttgtggc 7560 
aagtacctct tcaactgggc agtaaggacc aagctcaaac tcactccaat cccggctgcg 7620 



CA 02303526 2000-03-31 



tcccagttgg atttatccag ctggttcgtt gctggttaca gcgggggaga catatatcac 7680 
agcctgtctc gtgcccgacc ccgctggttc atgtggtgcc tactcctact ttctgtaggg 7740 
gtaggcatct atctactccc caaccgatga acggggagct aaacactcca ggccaatagg 7800 
ccatcctgtt tttttccctt tttttttttc tttttttttt tttttttttt tttttttttt 7860 
; ttttctcctt tttttttcct ctttttttcc ttttctttcc tttggtggct ccatcttagc 7920 
cctagtcacg gctagctgtg aaaggtccgt gagccgcttg actgcagaga gtgctgatac 7980 
ttigcctctct gcagatcaag t 8001 
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<210> 7 
<211> 11076 
<212> DNA 

<213> Hepatitis C Virus 



. <400> 7 
gbcagccfccc 
tcttcacgca 
cccccctccc 
gacgadcggg 
gcgagactgc 
gtgcttgcga 
ctcaaagaaa 
acgcaggttc 
caatcggctg 
ttgtcaagac 
cgtggctggc 
gaagggactg 
. ctcctgccga 
cggctacctg 
tggaagccgg 
ccgaactgtt 
atggcgatgc 
actgtggccg 
ttgctgaaga 
ctcccgattc 
cagaccacaa 
aacgttactg 
tccaccatat 
acgagcattc 
gtgaaggaag 
tgcaggcagc 
taagatacac 
gaaagagtca 
gtaccccatt 
tcgaggttaa 
cacgataata 
cgccgcccac 
ttgccgcgca 
caacctcgtig 
gctcagcccg 
ctcctgtcac 
cgcaatttgg 
attccgctcg 
gttctggagg 
ttccttttgg 
gtatccggag 
gcggacatga 
qgctgctggg 
acgatacgac 
gtgggagatc 
cggcacgaga 
cgtatggctt 
ttactccgga 
gcgggccttg 
ctctttgccg 
ctcgggatta 
aacggcagct 
ttccttgctg 
. gccagctgca 
tcacacagct 
gtacccgcgg 
gggacgaccg 
ctgcttctta 
agcactgggt 
aaaaccttga 



gattg^gg^c 
gaakgcgtct 
gggagagcca 
tcctttcttg 
tagccgagta 
gtgccccggg 
aaccaaacgt 
tccggccgct 
ctctgatgcc 
cgacctgtcc 
cacgacgggc 
gctgctattg 
gaaagtatcc 
cccattcgac 
tcttgtcgat 
cgccaggctc 
ctgcttgccg 
gctgggtgtg 
gcttggcggc 
gcagcgcatc 
cggtttccct 
gccgaagccg 
tgccgtcttt 
ctaggggtct 
cagttcctct 
ggaacccccc 
ctgcaaaggc 
aatggctctc 
gtatgggat<i 
aaaacgtcta 
ccatgggcac 
aggacgtcaa 

ggggccccag 

gaaggcgaca 
ggtacccctg 
cccgtggctc 
gtaaggtcat 
tcggcgcccc 
acggcgtgaa 
ctttgctgtc 
tgtaccatgt 
tcatgcatac 
tagcgctcac 
gccatgtcga 
tctgcggatc 
cagtacagga 
gggatatgat 
tcccacaagc 
cctactattc 
gcgttgacgg 
cgtccctctt 
ggcacatcaa 
cgctgttcta 
gccccatcga 
cggaccagag 
cgcaggtgtg 
accggttcgg 
acaacacgcg 
tcaccaagac 
cctgccccac 



gacact'ccac 
agccatggcg 
tagtggtctg 
gatcaacccg 
gtgttgggtc 
aggtctcgta 
aacaccaacg 
tgggtggaga 
gccgtgttcc 
ggtgccctga 
gttccttgcg 
ggcgaagtgc 
atcatggctg 
caccaagcga 
caggatgatc 
aaggcgcgca 
aatatcatgg 
gcggaccgct 
gaatgggctg 
gccttctatc 
ctagcgggat 
cttggaataa 
tggcaatgtg 
ttcccctctc 
ggaagcttct 
acctggcgac 
ggcacaaccc 
ctcaagcgta 
tgatctgggg 
ggccccccga 
gaatcctaaa 
gttcccgggc 
gttgggtgtg 
acctatcccc 
gcccctctat 
tcggcctagt 
cgataccctc 
cctagggggc 
ctatgcaaca 
ctgtttgacc~ 
cacgaacgac 
ccccgggtgc 
tcccacgctc 
tttgctcgtt 
tgttttcctc 
ctgcaattgc 
gatgaactgg 
tgtcgtggat 
catggtgggg 
gggaacctat 
ttcacccggg 
caggactgcc 
cgtgcacaag 
cgcgttcgct 
gccttattgt 
tggtccagtg 
cgtccctacg 
gccgccgcaa 
gtgcgggggc 
ggactgcttc 



catagatcac 
ttagtatgag 
cggaaccggt 
ctcaatgcct 
gcgaaaggcc 
gaccgtgcac 
ggcgcgccat 
ggctattcgg 
ggctgtcagc 
atgaactgca 
cagctgtgct 
c 99TO c agga 
atgcaatgcg 
aacatcgcat 
tggacgaaga 
tgcccgacgg 
tggaaaatgg 
atcaggacat 
accgcttcct 
gccttcttga 
caattccgcc 
ggccggtgtg 
agggcccgga 
gccaaaggaa 
tgaagacaaa 
aggtgcctct 
cagtgccacg 
ttcaacaagg 
cctcggtgca 
accacgggga 
cctcaaagaa 
ggtggtcaga 
cgcgcgacta 
aaggctcgcc 
ggcaatgagg 
tggggcccca 
acgtgcggct 
gctgccaggg 
gggaatctgc 
atcccagctt 
tgctccaacg 
gtgccctgcg 
gcggccagga 
ggggcggctg 
gtcgccca^c 
tcaatatatc 
tcacctacag 
atggtggcgg 
aactgggtta 
gtgacagggg 
tcatcccaga 
ctgaactgca 
ttcaactcat 
caggggtggg 
tggcactacg 
tactgcttca 
tacagttggg 
ggcaactggt 
cccccgtgta 
cggaagcacc 



tcccctgtga 
tgtcgtgcag 
gagtacaccg 
ggagatttgg 
ttgtggtact 
catgagcacg 
gattgaacaa 
ctatgactgg 
gcaggggcgc 
ggacgaggca 
cgacgttgtc 
tctcctgtca 
gcggctgcat 
cgagcgagca 
gcatcagggg 
cgaggatctc 
ccgcttttct 
agcgttggct 
cgtgctttac 
cgagttcttc 
cctctccctc 
cgtttgtcta 
aacctggccc 
tgcaaggtct 
caacgtctgt 
gcggccaaaa 
ttgtgagttg 
ggctgaagga 
catgctttac 
cgtggttttc 
aaaccaaacg 
tcgtcggtgg 
ggaagacttc 
agcccgaggg 
gcttggggtg 
cggacccccg 
tcgccgatct 
ccctgigcgca 
ccggttgctc 
ccgcttatga 
caagcattgt 
ttcgggagaa 
acgctagcgt 
ctctctgctc 
tgttcacctt 
ccggccacgt 
cagccctagt 
gggcccattg 
aggttctgat : 
ggacgatggc 
aaatccagct 
atgactccct 
ctggatgccc 
ggcccatcac 
caccccggcc 
ccccaagccc 
gggagaatga 
ttggctgtac 
acatcggggg 
ccgaggccac 



ggaactactg 60 
cctccaggac 120 
gaattgccag 180 
gcgtgccccc 240 
gcctgatagg 300 
aatcctaaac 360 
gatggattgc 420 
gcacaacaga 480 
ccggttcttt 54 0 
gcgcggctat 600 
actgaagcgg 660 
tctcaccttg 720 
acgcttgatc 780 
cgtactcgga 840 
ctcgcgccag 900 
gtcgtgaccc 960 
ggattcatcg 1020 
acccgtgata 1080 
ggtatcgccg 1140 
tgagtttaaa 1200 
ccccccccct 1260 
tatgttattt 1320 
tgtcttcttg 1380 
gttgaatgtc 1440 
age ga cc c ft 1500 
gccacgtgta 1560 
gatagttgtg 1620 
tgcccagaag 1680 
atgtgtttag 1740 
ctttgaaaaa 1800 
taacaccaac 1860 
agtttacctg 1920 
egageggteg 1980 
tagggectgg 2040 
ggcaggatgg 2100 
gegtaggteg 2160 
catggggtiac 2220 
tggegtcegg 2280 
dttttctatc 2340 
agtgcgcaac 2400 
gtatgaggca 2460 
caactcctcc 2520 
cbccactacg 2580 
cgctatgtac 2640 
ctcgcctcgc 2700 
gacaggtcac 2760 
ggtatcgcag 2820 
gggagtccta 2880 
tgtgatgcta 2940 
baaaaacacc 3000 
tgtaaacacc 3060 
caacactggg 3120 
agagegcatg 3180 
ttacaatgag 3240 
gtgeggtate 3300 
tgtcgtggtg 3360 
gaeggaegtg 3420 
atggatgaat 3480 
gateggcaat 3540 
ttacaccaag 3600 
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tgfcggttcgg ggccttggtt gacacccaga tgcttggtcc actacccata caggctttgg 3660 
cactacccct gcactgtcaa ctttaccatc ttcaaggtta ggatgtacgt ggggggagtg 3720 
gagcacaggc tcgaagccgc atgcaattgg actcgaggag agcgttgtaa cctggaggac 3780 
agggacagat cagagcttag cccgctgctg ctgtctacaa cggagtggca ggtattgccc 3840 
tgttccttca ccaccctacc ggctctgtcc actggtttga tccatctcca tcagaacgtc 3900 
/gtggacgtac aatacctgta cggtataggg tcggcggttg tctcctttgc aatcaaatgg 3960 
, gagtatgtcc tgttgctctt ccttcttctg gcggacgcgc gcgtctgtgc ctgcttgtgg 4020 
: atgatgdtgc tgatagctca agctgaggcc gccctagaga aicctggtggt cctcaacgcg 4080 
gcatccgtgg ccggggcgca tggcattctc tccttcctcg tgttcttctg tgctgcctgg 4140 
tacatcaagg gcaggctggt ccctggggcg gcatatgccc tctacggcgt atggccgcta 4200^ 
• ctcctgctcc tgctggcgtt accaccacga gcatacgcca tggaccggga gatggcagca 4260 
tcgtgcggag gcgcggtttt cgtaggtctg atactcttga ccttgtcacc gcactataag 4320 
ctgttcctcg ctaggctcat atggtggtta caatatttta tcaccagggc cgaggcacac 4380 
ttgcaagtgt ggatcccccc cctcaacgtt cgggggggcc gcgatgccgt catcctcctc .4440 
acgtgcgcga tccacccaga gctaatcttt accatcacca aaatcttgct cgccatactc 4500 
. ggtccactca tggtgctcca ggctggtata accaaagtgc cgtacttcgt gcgcgcacac 4560 
gggctcattc gtgcatgcat gctggtgcgg aaggttgctg ggggtcatta tgtccaaatg 4 620 
gctctcatga agttggccgc actgacaggt acgtacgttt atgaccatct caccccactg 4 680 
cgggactggg cccacgcggig cctacgagac cttgcggtgg cagttgagcc cgtcgtcttc 4740 
. tctgatatgg agaccaaggt tatcacctgg ggggcagaca ccgcggcgtg tggggacatc 4800 
, atcttgggcc tgcccgtctc cgcccgcagg gggagggaga tacatctggg accggcagac 4860 
agccttgaag ggcaggggtg gcgactcctc gcgcctatta cggcctactc ccaacagacg 4920 
, cgaggcctac ttggctgcat catcactagc ctcacaggcc gggacaggaa ccaggtcgag 4980 
ggggaggtcc aagtggtctc caccgcaaca caatctttcc tggcgacctg cgtcaatggc 5040 
gtgtgttgga ctgtctatca tggtgccggc tcaaagaccc ttgccggccc" aaagggccca 5100 
atcacccaaa tgtacaccaa tgtggaccag gacctcgtcg gctggcaagc gccccccggg 5160 
gcgcgttcct tgacaccatg Cacctgcggc agctcggacc tttacttggt cacgaggcat 5220 
. gccgatgtca ttccggtgcg ccggcggggc gacagcaggg ggagcctact ctcccccagg 5280 
. cccgtctcct acttgaaggg ctcttcgggc ggtccactgc tctgcccctc ggggcatgct 5340 
gtgggcatct ttcgggctgc cgtgtgcacc cgaggggttg cgaaggcggt ggactttgta 5400 
cccgtcgagt etatggaaac cactatgcgg tccccggtct tcacggacaa ctcgtcccct 54 60 
; ccggccgtac cgcagacatt ccaggtggcc catctacacg cccctactgg tagcggcaag 5520 
agcactaaigg tgccggctgc gtatgcagcc caagggtata aggtgcttgt cctgaacccg 5580 
tcc.gtcgccg ccaccctagg tttcggggcg tatatgtcta aggcacatgg tatcgaccct 5640 
aacatcagaa ccggggtagg gaccatcacc acgggtgccc ccatcacgta ctccacctat 5700 
ggcaagtttc ttgccgacgg tggttgctct gggggcgcct atgacatcat aatatgtgat 5760 
gagtgccact caactgacte gaccactatc ctgggcatcg gcacagtcct ggaccaagcg 5820 
gagacggctg gagcgcgact cgtcgtgctc gccaccgcta cgcctccggg atcggtcacc 5880 
gtgccacatc caaacatcga ggaggtggct ctgtccagca ctggagaaat ccccttttat 5940 
ggcaaagcca tccccatcgc gaccatcaag ggrggggaggc acctcatttt ctgccattcc 6000 
aagaagaaat gtgatgagct cgccgcgaag ctatccggcc tcggactcaa. tgctgtagca 6060 
tattaccggg gccttgatgt atccgtcata ccaactagcg gagacgtcat tgtcgtagca 6120 
acggacgctc taatgacggg ctttaccggc gatttcgact cagtgatcga ctgcaataca 6180 
tgtgtcaccc agacagtcga cttcagcctg gacccgacct tcaccattga gacgacgacc : 6240 
gtgccacaag acgcggtgtc acgctcgcag cggcgaggca ggactggtag gggcaggatg 6300 
ggcatttaca ggtttgtgac tccaggagaa cggccctcgg gcatgttcga ttcctcggtt 6360 
ctgtgcgagt gctatgacgc gggctgtgct 'tggtacgagc tcacgcccgc cgagacctca 6420 
gttaggttge gggcttacct aaacacacca gggctgcccg tctgccagga ccatctggag 6480 
ttctgggaga gcgtctttac aggcctcacc cacatagacg cccatttctt gtcccagact 6540 
aggcaggcag gagacaactt cccctacctg gtagcatacc aggctacggt gtgcgccagg 6600 
gctcaggctc cacctccatc gtgggaccaa atgtgggagt gtctcatacg gctaaagcct 6660 
acgctgcacg ggccaacgcc cctgctgtat aggctgggag ccgttcaaaa cgaggttact 6720 
accacacacc ccataaccaa atacatcatg gcatgcatgt cggctgacct ggaggtcgtc 6780 
acgagcacct gggtgctggt aggcggagtc ctagcagctc tggccgcgta ttgcctgaca 6840 
acaggcagcg tggtcattgt gggcaggatc atcttgtccg gaaagccggc catcattccc 6900 
gacagggaag tcctttaccg ggagttcgat gagatggaag agtgcgcctc acacctccct 6960 
tacatcgaac agggaatgca gctcgccgaa caattcaaac agaaggcaat cgggttgctg 7020 
caaacagcca ccaagcaagc ggaggctgct gctcccgtgg tggaatccaa gtggcggacc 7080 
ctcgaagcct tctgggcgaa gcatatgtgg aatttcatca gcgggataca atatttagca 714 0 
ggcttgtcca ctctgcctgg caaccccgcg atagcatcac tgatggcatt cacagcctct 7200 
atcaccagcc cgctcaccac ccaacatacc ctcctgttta acatcctggg gggatgggtg 7260 
gccgcccaac ttgctcctcc cagcgctgct tctgctttcg taggcgccgg catcgctgga 7320 
,gcggctgttg gcagcatagg ccttgggaag gtgcttgtgg atattttggc aggttatgga 7380 
gcaggggtgg caggcgcgct cgtggccttt aaggtcatga gcggcgagat gccctccacc 7440 
gaggacctgg ttaacctact ccctgctatc ctctcccctg gcgccctagt cgtcggggtc 7500 
gtgtgcgcag cgatactgcg tcggcacgtg ggcccagggg agggggctgt gcagtggatg 7560 
aaccggctga tagcgttcgc ttcgcggggt aaccacgtct cccccacgca ctatgtgtct 7620 
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$agagcgacg ctgcagcacg tgtcactcag atcctctcta gtcttaccat cactcagctg 7680 
ctgaagaggc ttcaccagtg gatcaacgag gactgctcca cgccatgctc cggctcgtgg 7740 
ctaagagatg tttgggattg gatatgcacg gtgttgactg atttcaagac ctggctccag 7800 
tccaagctcc tgccgcgatt gccgggagtc cccttcttct catgtcaacg tgggtacaag 7860 
ggagtctggc ggggcgacgg catcatgcaa accacctgcc catgtggagc acagatcacc 7920 
ggacatgtga aaaacggttc catgaggatc gtggggccta ggacctgtag taacacgtgg 7980 
catggaacat tccccattaa cgcgtacacc acgggcccct gcacgccctc cccggcgcca iB040 
aattattcta gggcgctgtg gcgggtggct gctgaggagt: acigtggaggt tabgcgggtg 8100 
ggig^tttcc actacgtgac gggcatgacc actgacaacg taaagtgccc gtgtcaggtt 8160 
ccggcccccg aattcttcac agaagtggat ggggtgcggt tgcacaggta cgctccagcg 8220 
tgcaaacccc tcctacggga ggaggtcaca ttcctggtcg ggctcaatca atacctggtt 8280 
gggtcacagc tcccatgcgg gcccgaaccg gacgtagcag tgctcacttc catgctcacc 8340 
gacccctccc acattacggc ggagacggct aagcgtaggc tggccagggg atctcccccc 8400 
tccttggcca gctcatcagc tagccagctg tctgcgcctt ccttgaaggc aacatgcact 8460 
acccgtcatg actccccgga cgctgacctc atcgaggcca acctcctgtg gcggca^gag 8520 
atgggcggga acatcacccg cgtggagtca gaaaataagg tagtaatttt ggactctttc 8580 
gagccgctcc aagcggagga ggatgagagg gaagtatccg ttccggcgga gatcctgcgg 8640 
aggtccagga aattccctcg agcgatgccc atatgggcac gcccggatta caaccctcca 8700 
ctgttagagt cctggaagga cccggactac gtccctccag tggtacacgg gtgtccattg 8760 
ccgcctgcca aggcccctcc gataccacct ccacggaggg agaggacggt tgtcctgtca 8820 
gaatcitaccg tgtcttctgc cttggcggag ctcgccacaa agaccttcgg cagctccgaa 8880 
tcgtcggccg tcgacagcgg cacggcaacg gcctctcctg accagccctc cgacgacggc 8940 
gacgcgggat ccgacgttga gtcgtactcc tccatgcccc cccttgaggg ggagccgggg 9000 
gatcccgatc tcagcgacgg gtcttggtct accgtaagcg aggaggctag tgaggacgtc 9060 
gtctgctgct cgatgtccta cacatggaca ggcgccctga tcacgccatg cgctgcggag 9120 
gaaaccaagc tgcccgtcaa tgcactgagc aactctttgc tccgtcacca caacttggtc 9180 
tatgctacaa catctcgcag cgcaagcctg cggcagaaga aggtcacctt tgacagactg 9240 
caggtcctgg acgaccacta ccgggacgtg ctcaaggaga tgaaggcgaa ggcgtccaca 9300 
gttaaggcta aacttctatc cgtggaggaa gcctgtaagc tgacgccccc acattcggcc 9360 
agatctaaat ttggctatgg ggcaaaggac gtccggaacc tatccagcaa ggccgttaac 9420 
cacatccgct ccgtgtggaa ggacttgctg gaagacactg agacaccaat tgacaccacc 94 80 
atcatggcaa aaaatgaggt tttctgcgtc caaccagaga aggggggccg caagccagct 9540 
cgccttatcg tattcccaga tttgggggtt cgtgtgtgcg agaaaatggc cctttacgat 9600 
gtggtctcca ccctccctca ggccgtgatg ggctcttcat acggattcca atactctcct 9660 
ggacagcggg tcgagttcct ggtgaatgcc tggaaagcga agaaatgccc tatgggcttc 9720 
gcatatgaca cccgctgttt tgactcaacg gtcactgaga atgacatccg tgttgaggag 9780 
tcaatctacc aatgttgtga cttggccccc gaagccagac aggccataag gtcgCtcaca 9840 
; gagcggcttt acatcggggg ccccctgact aattctaaag ggcagaactg cggctatcgc 9900 
cggtgccgcg cgagcggtgt actgacgacc agctgcggta ataccctcac atgttatttg 9960 "-. 
aaggccgctg cggcctgtcg agctgcgaag ctccaggact gcacgatgct cgtatgcgga 10020 
gacgaccttg tcgttatctg tgaaagcgcg gggacccaag aggacgaggc gagcctacgg 10080 
gccttcacgg aggctatgac tagatactct gccccccctg gggacccgcc caaaccagaa 10140 
tacgacttgg agttgataac atcatgctcc tccaatgtgt cagtcgcgca cgatgcatct 10200 
ggcaaaaggg tgtactatct cacccgtgac cccaccaccc cccttgcgcg ggctgcgtgg 10260 
gagacagcta gacacactec agtcaattcc tggctaggca acatcatcat gtatgcgccc 10320 
accttgtggg caaggatgat cctgatgact catttcttct ccatccttct agctcaggaa 10380 
caacttgaaa aagccctaga ttgtcagatfc'tacggggcct gttactccat tgagccactt 104 40 
gacctacctc agatcattca acgactccat ggccttagcg cattttcact ccatagttac 10500 
tctecaggtg agatcaatag ggtggcttca tgcctcagga aacttggggt accgcccttg 10560 
cgagtctgga gacatcgggc cagaagtgtc cgcgctaggc tactgtccca gggggggagg 10620 
^fctgccactt gtggcaagta cctcttcaac tgggcagtaa ggaceaagct caaactcact 10680 
ccaatcccgg ctgcgtccca gttggattta tccagctggt tcgttgctgg ttacagcggg 10740 
ggagacatat atcacagcct gtctcgtgcc cgaccccgct ggttcatgtg gtgcctactc 10800 
ctactttctg taggggtagg catctatcta ctccccaadc gatgaacggg gagctaaaca 10860 
ctccaggcca ataggccatc ctgttttttt cccttttttt ttttcttttt tttttttttt 10920 
tttttttttt tttttttttc tccttttttt ttcctctttt tttccttttc tttcctttgg 10980 
tggctccatc ttagccctag tcacggctag ctgtgaaagg tccgtgagcc gcttgactgc 11040 
agagagtgct gatactggcc tctctgcaga tcaagt * 11076 
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<210> 8 
. <211> 8001 
<212> DNA 

<213> Hepatitis C Virus 
<400>> 8 

y gccagccccc gattgggg<jc gacactccac catagatcac tcccctgtga ggaactactg 60 
tcttcacgca gaaagcgtct agccatggcg ttagtatgag tgtcgtgcag cctccaggac 120 
cccccctccc gggagagcca tagtggtctg cggaaccggt gagtacaccg gaattgccag 180 
gacgaccggg tcctttcttg gatcaacccg ctcaatgcct ggagatttgg gcgtgccccc 240 
. gcgagactge tagccgagta gtgttgggtc gcgaaaggcc ttgtggtact gcctgatagg 300 
. gtgcttgcga gtgccccggg aggtctcgta gaccgtgcac catgagcacg aatcctaaac 360 
ctcaaagaaa aaccaaacgt aacaccaacg ggcgcgccat gattgaacaa gatggattgc 420 
acgcaggttc tccggccgct tggcftggaga ggctattcgg ctatgactgg gcacaacaga 480 
caatcggctg ctctgatgcc gccgtgttcc ggctgtcagc gcaggggcgc ccggttcttt 540 
ttgtcaagac cgacctgtcc ggtgccctga atgaactgca ggacgaggca gcgcggctat 600 
cgtggctggc cacgacgggc gttccttgcg cagctgtgct cgacgttgtc actgaagcgg 660 
gaagggactg gctgctattg ggcgaagtgc cggggcagga tctcctgtca tctcacdttg 720 
ctcctgccga gaaagtatcc atcatggctg atgcaatgcg gcggctgcat acgcttgatc 780 
cggctacctg cccattcgac caccaagcga aacatcgcat cgagcgagca cgtactcgga 840 
tggaagccgg tcttgtcgat caggatgatc tggacgaaga gcatcagggg ctcgcgccag 900 
ccgaactgtt cgccaggctc aaggcgcgca tgcccgacgg cgaggatctc gtcgtgaccc 960 
, atggcgatgc ctgcttgccg aatatcatgg tggaaaatgg ccgcttttct ggattcatcg 1020 
actgtggccg. gctgggtgtg gcggaccgct atcaggacat agcgttggct acccgtgata 1080 
ttgctgaaga gcttggcgge gaatgggctg accgcttcct cgtgctttac trgtatcgccg 1140 
ctccccfattc gcagcgbatc gccttctatc gccttcttga cgagttcttc tgagtttaaa 1200 
cagaccacaa cggtttccct ctagcgggat caattccgcc cctctccctc ccccccccct 1260 
aacgttactg gccgaagccg cttggaataa ggccggtgtg cgtttgtcta tatgttattt 1320 
tccaccatat tgccgtcttt tggcaatgtg agggcccgga aacctggccc tgtcttcttg 138t0 
acgagcattc ctaggiggtct ttcccctctc gccaaaggaa tgcaaggtct gttgaatgtc 1440 
gtgaa^gaag cagttcctct ggaagcttct tgaagacaaa caacgtctgt agfcgaccctt 1500 
tgcaggcagc ggaacccccc acctggcgac aggtgcctct gcggccaaaa gccacgtgta 1560 
taagatacac ctgcaaaggc ggcataaccc cagtgccacg ttgtgagttg gatagttgtg 1620. 
gaaagagtca aatggctctc ctcaagcgta ttcaacaagg ggctgaagga tgcccagaag 1680 
gtaccccatt gtatgggatc tgatctgggg cctcggtgca catgctttac atgtgtttag 1740 
tcgaggttaa aaaacgtcta ggccccccga accacgggga cgtggttttc ctttgaaaaa 1800 
cacgataata ccatggcgcc tattacggcc tactcccaac agacgcgagg cctacttggc 1860 
tgcatcatca ctagcctcac aggccgggac aggaaccagg tcgaggggga ggtccaagtg 1920 
gtctccaccg caacacaatc tttcctggcg acctgcgtca atggcgtgtg ttggactgtc 1980 
tat'catggtg ccggctcaaa gacccttgcc ggcccaaagg gcccaatcac ccaaatgtac 2040 
accaatgtgg accaggacct cgtcggctgg caagcgcccc ccggggcgcg ttccttgaca 2100 
ccatgcacct gcggcagctc ggacctttac ttggtcacga ggcatgccga tgtcattccg 2160 
gtgcgccggc ggggcgacag cagggggagc ctactctccc ccaggcccgt ctcctacttg 2220 
aagggctctt cgggcggtcc actgctctgc ccctcggggc acgctgtggg catctttcgg 2280 
gctgccgtgt gcacccgagg ggttgcgaag^gcggtggact ttgtacccgt egagtctatg 2340 
gaaaccacta tgcggtcccc g^ftcttcacg'gacaactcgt cccctccggc cgtaccgcag 2400 
acattccagg tggcccatct acacgcccct actggtagcg gcaagagcac taaggtgccg 24 60 
gctgcgtatg cagcccaagg gtataaggtg cttgtcctga acccgtccgt cgccgccacc 2520 
ctaggtttcg gggcgtatat gtctaaggca catggtatcg accctaacat cagaatcggg 2580 
gtaaggacca tcaccacggg tgcccccatc acgtactcca cctatggcaa gtttcttgcc 2640 
gacggtggtt gctctggggg cgcctatgac atcataatat gtgatgagtg ccactcaact 2700 
gactcgacca ctatcctggg catcggcaca gtcctggacc aagcggagac ggctggagcg 2760 
cgactcgtcg tgctcgccac cgctacgcct ccgggatcgg tcaccgtgcc acatccaaac 2820 
atcgaggagg tggctctgtc cagcactgga gaaatcccct tttatggcaa agccatcccc 2880 
atcgagacca tcaaggg^gg gaggcacctc attttctgcc attccaagaa gaaatgtgat 2940 
gagctcgccg cgaaigetgtc cggcctcgga ctcaatgctg tagcatatta ccggggcctt 3000 
gatgtatccg tcataccaac tagcggagac gtcattgtcg tagcaacgga cgctctaatg 3060 
acgggcttta ccggtgactt cgactcagtg atcgactgca atacatgtgt cacccagaca 3120 
gtcgacttca gcctggaccc gaccttcacc attgagacga cgaccgtgcc acaagacgcg 3180 
gtgtcacgct cgcagcggcg aggcaggact ggtaggggca ggatgggcat ttacaggttt 3240 
gtgactccag gagaacggcc ctcgggcatg ttcgattcct cggttctgtg cgagtgctat 3300 
gacgcgggct gtgcttggta cgagctcacg cccgccgaga cctcagttag gttgcgggct 3360 
tacctaaaca caccagggtt gcccgtctgc caggaccatc tggagttctg ggagagcgtc 3420 
tttacaggcc tcacccacat agacgcccat ttcttgtccc agactaagca ggcaggagac 3480 
aacttcccct acctggtagc ataccaggct acggtgtgcg ccagggctca ggctccacct 3540 
ccatcgtggg accaaatgtg gaagtgtctc atacggctaa agcctacgct gcacgggcca 3600 
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acgcccctgc tgtataggct gggagccgtt caaaacgagg ttactaccac acaccccata 3660 . 
accaaataca tcatggcatg catgtcggct gacctggagg tcgtcacgag cacctgggtg 3720 
ctggtaggcg gagtcctagc agctctggcc gcgtattgcc tgacaacagg cagcgtggtc 3780 

gtccggaaag ccggccatca ttcccgacag ggaagtcctt 3840 
ggaagagtgc gcctcacacc tcccttafeat cgaacaggga 3900 
caaacagaag gcaatcgggt tgctgcaaac agccaccaag 3960 
cgtggtggaa tccaagtggc ggaccatcga agccttctgg 4020 
catcagcggg atacaatatt tagcaiggctt gtccactctg 4080 
atcactgatg gcattcacag cctctatcae cagcccgctc 4140: 
gtttaacatc ctggggggat gggtggccgc ccaacttgct 4200 
tttcgtaggc gccggcatcg ctggagcggc tgttggcagc 4260 
tgtggatatt ttggcaggtt atggagcagg ggtggcaggc 4320 
catgagcggc gagatgccct 
ccctggcgcc ctagtcgtcg 
aggggagggg gctgtgcagt 
cgtctccccc acgcactatg 
ctctagtctt accatcactc 



attgtgggca ggatcatctt 
taccgggagt tcgatgagat 
at^cagctcg ccgaacaatt 
caagcggagg ctgctgctcc 
gcgaagcata tgtggaattt 
cctggcaacc ccgcgatagc 
accacccaac ataccctcct 
cctcccagcg ctgcttctgc 
ata^ccttg ggaaggtgct 
gcgctcgtgg cctttaaggt 
ctactccctg ctatcctctc 
ctgcgtcggc acgtgggccc 
ttcgcttcgc ggggtaacca 
gcacgtgtca ctcagatcct 



cagtggatca acgaggactg ctccacgcca tgctccggct 
gaittggatat gcacggtgtt gactgatttc aagacctggc 
cgattgccgg gagtcecctt cttctcatgt caacgtgggt 
gaciggcatca tgcaaaccac ctgcccatgt ggggcacaga 
ggatcgtggg gcctaggacc 
acaccacggg cccctgcacg 
tggctgctga ggagtacgtg 



ggttccatga 
attaacgcgt 
ctgtggcggg 
gtgacgiggca 
ttcacagaag 

cgggaggagg 



tgaccactga 
tggatggggt 
tcacattcct 
tgcgagcccg aaccggatgt 
acggcggaga eggctaagcg 
tcagctagcc agctgtctgc 
ccggacgctg acctcatcga ggccaacctc 
acccgcgtgg agtcagaaaa taaggtagta 
agagggaagt atccgttccg 
tgcccatatg ggcacgcccg 
actacgtccc tccagtggta 
caccttcacg 
cggagctcgc 
caacggcctc 
actcctccat 
ggtctaccgt 



gaggaggatg 
cctc^ragcga 
aaggacccgg 
cctccgatac 
tctgccttgg 
agcggicaegg 
gttgagtcgt 
gacgggtctt 
tcctacacat 
atcaatgcac 



cgacgtaaag 
gcggttgcac 
ggtcgggctc 
agcagtgctc 
taggctggcc 
gccttccttg 



gaggaagagg 
cacagagacc 



tgtaigtaaca 
ccctccccgg 
gaggttacgc 
tgcccgtgtc 
aggtacgctc 
aatcaatacc 
acttccatgc 
aggggatctc 
aaggcaacat 
ctgtggcggc 



gattacaacc 
cacgggtgtc 
acggttgtcc 
ttcggcagct 



4380 
4440 
4500 
4560 
4620 
4 680 
4740 
4800 
4860 



ccaccgagga cctggttaac 
gggtcgtgtg cgcagcgata 
ggatgaaccg gctgatagcg 
tgcctgagag cgacgctgca 
agctgctgaa gaggcttcac 
cgtggctaag agatgtttgg 
tccagtccaa gctcctgccg 
acaagggagt ctggcggggc 
tcaccggaca tgtgaaaaac 
cgtggcatgg aacattcccc 4920 
cgccaaatta ttctagggcg 4980 
gggtggggga tttccactac 
aggttccggc ccccgaattc 
cagcgtgcaa acccctccta 
tggttgggtc acagctccca 
tcaccgaccc ctccciacatt 
ctcccccctt ggccagctca 
gcactacccg tcatgactcc 
aggagatggg cgggaacatc 
attttggact ctttcgagcc gctccaagcg 
gcggagatcc tgcggaggtc caggaaattc 
ctccactgtt agagtcctgg 
cattgccgcc tgccaaggcc 
tgtcagaatc taccgtgtct 
ccgaatcgtc ggccgtcgac 



tcctgaccag ccctccgacg acggcgacgc gggatccgac 
gccccccctt gagggggagc cgggggatcc cgatctcagc 



5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 



aagcgaggag gctagtgagg acgtcgtctg ctgctcgatg 6000 



ggacaggcgc cctgatcacg ccatgcgctg cggaggaaac caagctgccc 
tgagcaactc tttgctccgt caccacaact tggtctatgc tacaacatct 
cgcagcgcaa gcctgcggca gaagaaggtc acctttgaca gactgcaggt cctggacgac 
cactaccggg acgtgctcaa ggagatgaag gcgaaggcgt ccacagttaa ggctaaactt 
ctatccgtgg ag^aagcctg taagctgacg cccccacatt cggccagatc taaatttggc 
tatggggcaa aggacgtccg gaacGtatcc agcaaggccg ttaaccacat ccgctccgtg 
tggaaggact tgctggaaga cactgagaca" ccaattgaca ccaccatcat 
gaggttttct gcgtccaacc agagaagggg ggccgcaagc cagctcgcct 



ccagatttgg gggttcgtgt gtgcgagaaa 
cctcaggccg tgatgggctc ttcatacgga 
ttcbtggtga atgcctggaa agcgaagaaa 
tgttttgact caacggtcac tgagaatgac 
tgtgacttgg cccccgaagc cagacaggcc 
gggggcccec tgactaattc taaagggcag 
ggtgtactga cgaccagctg cggtaatacc 
tgtcgagctg cgaagctcca ggactgcacg 



atggcccttt acgatgtggt 
ttccaatact ctcctggaca 
tgccctatgg gcttcgcata 
atccgtgttg aggagtcaat 
ataaggtcgc tcacagagcg gctttacatc 
aactgcggct atcgccggtg ccgcgcgagc 
ctcacatgtt .acttgaaggc cgctgcggcc 
atgctcgtat gcggagacga ccttgtcgtt 



ggcaaaaaat 
tatcgtattc 
ctccaccctc 
gcgiggtcgag 
tgacacccgc 
ctaccaatgt 



atctgtgaaa gcgcggggac ccaagaggac gaggcgagcc tacgggcctt cacggaggct 
atgactagat actctgcccc ccctggggac ccgcccaaac cagaatacga cttggagttg 
ataacatcat gctcctccaa tgtgtcagtc gcgcacgatg catctggcaa aagggtgtac 
tatctcaccc gtgaccccac cacccccctt gcgcgggctg cgtgggagac agctagacac 
attcctggct aggcaacatc atcatgtatg cgcccacctt gtgggcaagg 
tgactcattt cttctccatc cttctagctc aggaacaact tgaaaaagcc 
agatctacgg ggcctgttac tccattgagc cacttgacct acctcagatc 
tccatggcct tagcgcattt tcactccata gttactctcc aggtgagatc 
cttcatgcct caggaaactt ggggtaccgc ccttgcgagt ctggagacat 
gtgtccgcgc taggctactg tcccaggggg ggagggctgc cacttgtggc 
tcaactgggc agtaaggacc aagctcaaac tcactccaat cccggctgcg 



actccagtca 
atgatcctga 
ctagattgtc 
attcaacgac 
aatagggtgg 
cgggccagaa 
aagtacctct 



6060 

6120 

6180 

6240 

6300 

6360 

6420 

6480 

6540 

6600 

6660 

6720 

6780 

6840 

6900 

6960 

7020 

7080 

7140 

7200 

7260 

7320 

7380 

7440 

7500 

7560 

7620 
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tcccagttgg atttatccag ctggttcgtt gctggttaca gcgggggaga catatatcac 7680 
agcctgtctc gtgcccgacc ccgctggttc atgtggtgcc tactcctact ttctgtaggg 7740 
gtaggcatct atctactccc caaccgatga acggggagct aaacactcca ggccaatagg 7800 
. ccatcctgtt tttttccctt tttttttttc tttttttttt tttttttttt tttttttttt 7860 
J ' ttttctcctt tttttttcct ctttttttcc ttttctttcc tttggtggct ccatcttagc 7920 

. cctagtcacg gctagctgtg aaaggtccgt gagccgcttg actgcagaga - gtgctgatac 7980 
: tggcctctct gca^atcaag t * 8001 
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<210> 9 
<211> 11076 
<212> DNA 

<213> Hepatitis C Virus 
<40O> 9 

gccagcdccc gattgggggc gacactccac catagatcac tcccctgtga ggaactactg 60 
tcttcacgca gaaagcgtct agccatggcg ttagtatgag tgtcgtgcag cctccaggac 120 
cccccctccc gggagagcca tagtggtctg cggaaccggt gagtacaccg gaattgccag 180 
gacgaccggg tcctttcttg gatcaacccg ctcaatgcct ggagatttgg gc^tgccccc 240 
gcgagactgc tagccgagta gtgttgggtc gcgaaaggcc ttgtggtact gcctgatagg 300 
gtgcttgcga gtgccccggg aggtctcgta gaccgtgcac catgagcacg aatcctaaac 360 
ctcaaagaaa aaccaaacgt aacaccaacg ggcgcgccat gattgaacaa gatggattgc 420 
acgcaggttc tccggccgct tgggtggaga ggctattcgg ctatgactgg gcacaacaga 480 
caatcggctg ctctgatgcc gccgtgttcc ggctgtcagc gcaggggcgc ccggttcttt 540 
ttgtcaagac cgacctgtcc ggtgccctga atgaactgca ggacgaggca gcgcggctat 600 
cgtggctggc cacgacgggc gttccttgcg cagctgtgct cgacgttgtc actgaagcgg 660 
gaagggactg gctgctattg ggcgaagtgc cggggcagga tctcctgtca tctcaccttg 720 
ctcctgccga gaaagtatcc atcatggctg atgcaatgcg gcggctgcat acgcttgatc 780 
cggctacctg cccattcgac caccaagoga aacatcgcat cgagcgagca cgtactcgga 840 
tggaagccgg tcttgtcgat caggatgatc tggacgaaga gcatcag9fgg ctcgcgccag 900 
ccgaactgtt cgccaggctc aaggcgcgca tgcccgacgg cgaggatctc gtcgtgaccc 960 
atggcgatgfc ctgcttgccg aatatcatgg tggaaaatgg ccgcttttct g^attcatcg 1020 
actgtggccg gctgggtgtg gcggaccgct atcaggacat agcgttggct acccgtgata 1080 
ttgctgaaga gcttggcggc gaatgggctg accgcttcct cgtgctttac ggtatcgccg il40 
ctcbcgattc gcagcgcatc gccttctatc gccttcttga cgagttcttc tgagtttaaa 1200 
cagaccacaa cggtttccct ctagcgggat caattccgcc cctctccctc ccccccccct 1260 
aacgttactg gccgaagccg cttggaataa ggccggtgtg cgtttgtcta tatgttattt 1320 
tccaccatat tgccgtcttt tggcaatgtg agggcccgga aacctggccc tgtcttcttg 1380 
atgagcattc ctaggggtct ttcccctctc gccaaaggaa tgcaaggtct gttgaatgtc 1440 
gtgaaggaag cagttcctct ggaagcttct tgaagacaaa caacgtctgt agcgaccctt 1500 
tgcaggcagc ggaacccccc acctggcgac aggtgcctct gcggccaaaa gccacgtgta 1560 
taagatacac ctgcaaaggc ggcacaaccc cagtgccacg ttgtgagttg gatagttgtg 1(520 
gaaagagtca aatggctctc ctcaagcgta ttcaacaagg ggctgaagga tgcccagaag 1680 
gtaccccatt gtatgggatc tgatctgggg cctcggtgca catgctttac atgtgtttag 1740 
tcgaggttaa aaaacgtcta ggccccccga accacgggga cgtggttttc ctttgaaaaa 1800 
cacgataata ccatgggcac gaatcctaaa cctcaaagaa aaaccaaacg taacaccaac 1860 
cgccgcccac aggacgtcaa gttcccgggc ggtggtcaga tcgtcggtgg agtttacctg 1920 
ttgccgcgca ggggccccag gttgggtgtg cgcgcgacta ggaagacttc cgagcggtcg 1980 
caacctcgtg gaaggcgaca acctatcccc aaggctcgcc agcccgaggg tagggcctgg 2040 
gctcagcccg ggtacccctg gcccctctat ggcaatgagg gcttggggtg ggcaggatgg 2100 
ctcctgtcac cccgtggctc tcggcctagt tggggcccca cggacccccg gcgtaggtcg 2160 
cgcaatttgg gtaaggtcat cgataccctc acgtgcggct tcgccgatct catggggtac 2220 
attccgctcg tcggcgcccc cctagggggc gctgccaggg ccctggcgca tggcgtccgg 2280 
gttctggagg acggcgtgaa ctatgcaaca gggaatctgc ccggttgctc cttttctatc 2340 
ttccttttgg ctttgctgtc ctgtttgacc'atcccagctt ccgcttatga agtgcgcaac 2400 
gtatccggag tgtaccatgt cacgaacgac tgctccaacg caagcattgt gtatgaggca 2460 
gcggacatga tcatgcatac ccccgggtgc gtgccctgcg ttcgggagaa caactcctcc 2520 
cgctgctggg tagcgctcac tcccacgctc gcggccagga acgctagcgt ccccactacg 2580 
acgatacgac gccatgtcga tttgctcgtt ggggcggctg ctctctgctc cgctatgtac 2640 
gtgg^agatc tctgcggatc tgttttcctc gtcgcccagc tgttcacctt ctcgcctcgc 2700 
cggcacgaga cagtacagga ctgcaattgc tcaatatatc ccggccacgt gacaggtcac 2760 
cgtatggctt gggatatgat gatgaactgg tcacctacag cagccctagt ggtatcgdag 2820- 
ttactccgga tcccacaagc tgtcgtggat atggtggcgg gggcccattg gggagtccta 2880 
gcgggccttg cctactattc catggtgggg aactgggcta aggttctgat tgtgatgcta 2940 
ctctttgccg gcgttgacgg gggaacctat gtgacagggg ggacgatggc caaaaacacc 3000 
ctcgggatta cgtccctctt ttcacccggg tcatcccaga aaatccagct tgtaaacacc 3060 
aacggcagct ggcacatcaa caggactgcc ctgaactgca atgactccct caacactggg 3120 
ttccttgctg cgctgttcta cgtgcacaag ttcaactcat ctggatgccc agagcgcatg 3180 
gccagctgca gccccatcga cgcgttcgct caggggtggg ggcccatcac ttacaatgag 3240 
tcacacagct cggaccagag gccttattgt tggcactacg caccccggcc gtgcggtatc 3300 
gtacccgcgg cgcaggtgtg tggtccagtg tactgcttca ccccaagccc tgtcgtggtg 3360 
gggacgaccg acdggttcgg cgtccctacg tacagttggg gggagaatga gacggacgtg 3420 
ctgcttctta acaacacgcg gccgccgcaa ggcaactggt ttggctgtac atggatgaat 3480 
agcactgggt tcaccaagac gtgcgggggc cccccgtgta acatcggggg gatcggcaat 3540 
aaaaccttga cctgccccac ggactgcttc cggaagcacc ccgaggccac ttacaccaag 3600 
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tgtggttbgg ijgccttggftt gacacecaga tgcttggtcc actacccata caggctttgg 3660 
cactaccCct gcactgtcaa ctttaccatc ttcaaggtta ggatgtacgt ggggggagtg 3720 
gagcacagge tcgaagccgc atgcaattgg actcgaggag agfcgttgtaa cctggaggac 3780 
agggacag'at cagagcttag cccgctgctg ctgtctacaa cggagtggca ggtattgccc 3840 
tgttcettca ccaccctacc.ggctctgtcc actggtttga tccatctcca tfcagaacgtc 3900 
. gtggacgtac aatacctgta cggtataggg tcggcggttg tctcctttgc aatcaaatgg 3960 
gagtatgtcc tgttgctctt ccttcttctg gcggacgcgc gcgtctgtgc ctgcttgtgg 4020 
atgatgctgc tgatagctca agctgaggcc gccctagaga acctggtggt cctcaacgcg 4080 
gcatccgtgg ccggggcgca tggcattctc tccttcctcg tgttcttctg tgctgcctgg 4140 
tacatcaagg gcaggctggt' ccctggggcg gcatatgccc tctacggcgft atggccgcta 4200 
ctcctgctcc tgctggcgtt accaccacga gcatacgcca tggaccggga gatggcagca 4260 
tcgtgcggag gcgcggtttt cgtaggtctg atactcttga ccttgtcacc gcactataag 4320 
ictgttcctcg ctaggctcat atggtggtta caatatttta tcaccagggc cgaggcacac 4380 
ttgcaagtgt ggatcccccc cctcaacgtt cgggggggcc gcgatgccgt datcctcctc 4440 
acgtgcgcga tccacccaga gctaatcttt accatcacca aaatcttgct cgccatactc 4500 
ggtctactca tggtgctcca ggctggtata accaaagtgc cgtacttcgt gcgcgcacac 4560 
gggctcattc gtgcatgcat gctggtgcgg aaggttgctg ggggtcatta tgtccaaatg 4620 
. gctctcatgfa agttggccgc actgacaggt acgtacgttt atgaccatct caccccactg 4680 
cgggactggg cccacgcggg cctacgagac cttgcg^tgg cagttgagcc cgtcgtcttc 4740 
tctgatatgg agaccaaggt tatcacctgg ggggcagaca ccgcggcgtg tggggacatc 4800 
atcttgggcc tgcccgtctc cgcccgcagg gggagggaga tacatctggg accggcagac 4860 
agccttgaag ggcaggggtg gcgactcctc gcgcctatta cggcctactc bcaacagacg 4 920 
cgaggcctac ttggctgcat catcactagc ctcacaggcc gggacaggaa ccaggtcgag 4980 
ggggaggtcc aagtggtctc caccgcaaca caatctttcc tggcgacctg cgtcaatggc 5040 
gtgtgttgga ctgtctatca tggtgccggc tcaaagaccc ttgccggccc aaagggccca 5100 
atcacccaaa tgtacaccaa tgtggaccag gacctcgtcg gctggcaagc gccccccggg 5160 
gcgcgttcct tgacaccatg cacctgcggc agctcggacc tttacttggt cacgaggcat 5220 
gccgatgtca ttccggtgcg ccggcggggc gacagcaggg ggagcctact ctcccccagg 5280 
cccgtctcct acttgaaggg ctcttcgggc ggtccactgc tctgcccctc ggggcacgct 5340 
gtgggcatct ttcgg^gctgc cgtgtgcacc cgaggggttg cgaaggcggt ggactttgta 5400 
cccgtcgagt ctatggaaac cactatgcgg tccccggtct tcacggacaa ctcgtcccct '5460 
ccggccgtac cgcagacatt ccaggtggcc catctacacg cccctactgg tagcggcaag 5520 
agcactaagg tgccggctgc gtatgcagcc caagggtata aggtgcttgt cctgaacccg 5580 
tccgtcgccg ccaccctagg tttcggggcg tatatgtcta aggcacatgg tatcgaccct 5640 
aacatcagaa tcggggtaag gaccatcacc acgggtgccc ccatcacgta ctccacctat 5700 
ggcaagtttc ttgccgacgg tggttgctct gggggcgcct atgacatcat aatatgtgat 5760 
gagtgccact caactgactc gaccactatc ctgggcatcg gcacagtcct ggaccaagcg 5820 
gagacggctg gagcgcgact cgtcgtgctc gccaccgcta cgcctccggg atcggtcacc 5i380 
gtgccacatc caaacatcga ggaggtggct ctgtccagca ctggagaaat ccccttttat 5940 
ggcaaagcca tccccatcga gaccatcaag ggggggaggc acctcatttt ctgccattcc 6000 
aagaagaaat gtgatgagct cgccgcgaag ctgtccggcc tcggactcaa tgctgtagca 6060 
tattaccggg gccttgatgt atccgtcata ccaactagcg gagacgtcat tgtcgtagca 6120 
acggacgctc taatgacggg ctttaccggt gacttcgact cagtgatcga ctgcaataca 6180 
tgtgtcaccc agacagtcga cttcagcctg gacccgacct tcaccattga gacgacgacc 6240 
gtgccacaag acgcggtgtc acgctcgcag cggcgaggca ggactggtag gggcaggatg 6300 
ggcatttaca ggtttgtgac tccaggagaa cggccctcgg gcatgttcga ttcctcggtt 6360 
ctgtgcgagt gctatgacgc gggctgtgct 'tggtacgagc tcacgcccgc cgagacctca 6420 
gttaggttgc gggcttacet aaacacacca gggttgcccg tctgccagga ccatctggag 6480 
ttctgggaga gcgtctttac aggcctcacc cacatagacg cccatttctt gtcccagact 6540 
aagcaggcag gagacaactt cccctacctg gtagcatacc aggctacggt gtgcgccagg 6600 
• gctcaggctc cacctccatc gtgggaccaa atgtggaagt gtctcatacg gctaaagcct 6660 
acgctgcacg ggccaacgcc cctgctgtat aggctgggag ccgttcaaaa cgaggttact 6720 
accacacacc ccataaccaa atacatcatg gcatgcatgt cggctgacct ggaggtcgtc 6780 
acgagcacct gggtgctggt aggcgga^rtc ctagcagctc tggccgcgta ttgcctgaca 6840 
acaggcagcg tggtcattgt gggcaggatc atcttgtccg gaaagccggc catcattccc 6900 
gacagggaag tcctttaccg ggagttcgat gagatggaag agtgcgcctc acacctccct 6960 
tacatcgaac agggaatgca gctcgccgaa caattcaaac agaaggcaat cgggttgctg 7020 
caaacagcca ccaagcaagc ggaggctgct gctcccgtgg tggaatccaa gtggcggacc 7080 
atcgaagcct tctgggcgaa gcatatgtgg aatttcatca gcgggataca atatttagca 7140 
ggcttgtcca ctctgcctgg caaccccgcg atagcatcac tgatggcatt cacagcctct 7200 
atcaccagcc cgctcaccac ccaacatacc ctcctgttta acatcctggg gggatgggtg 7260 
gccgcccaac ttgctcctcc cagcgctgct tctgctttcg taggcgccgg catcgctgga 7320 
gcggctgttg gcagcatagg ccttgggaag gtgcttgtgg atattttggc aggttatgga 7380 
gcaggggtgg caggcgcgct cgtggccttt aaggtcatga gcggcgagat gccctccacc 7440 
gaggacctgg ttaacctact ccetgctatc ctctcccctg gcgccctagt cgtcggggtc 7500 
gtgtgcgcag cgatactgcg tcggcacgtg ggcccagggg agggggctgt gcagtggatg 7560 
aaccggctga tagcgttcgc ttcgcggggt aaccacgtct cccccacgca ctatgtgcct 7620 
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gagagcgacg ctgcagcacg tgtcactcag atcctctcta gtcttaccat cactcagctg 7680 
ctgaagaggc ttcaccagtg gatcaacgag gactgctcca cgccatgctc cggctcgtgg 7740 
ctaagagatg tttgggattg gatatgcacg gtgttgactg atttcaagac ctggctccag 7800 
tccaagctcc tgccgcgatt gccgggagtc cccttcttct catgtcaacg tgggtacaag 7860 
ggagtctggc ggggcgacgg catcatgcaa accacctgcc catgtggggc acagatcacc 7920 
ggacatgtga aaaacggttc catgaggatc gtggggccta ggacctgtag taacacgtgg 7980 
catggaacat tccccattaa cgcgtacacc acgggcccct gcacgccctc cccggcgcca 8040 
aattattcta gggcgctgtg gcgggtggct gctgaggagt acgtggaggt tacgcgggtg. 8100 
ggggatttcc actacgtgac ggigcatgacc actgabgacg taaagtgccc gtgtcaggtt ,8160 
ccggcccccg aattcttcab agaagtggat ggggtgcggt tgcacaggta cgctccagcg 8220 
tgcaaacccc tcctacggga ggaggtcaca ttcctggtcg ggctcaatca atacctggtt 8280 
gggftcacagc tcccatgcga gcccgaaccg gatgtagcag tgctcacttc catgctcacc 8340 
gaccectccc acattacggc ggagacggct aagcgtaggc tggccagggg atctcctccc 8400 
cccttgigcca gctcatcagc tagccagctg tctgcgcctt ccttgaaggc aacatgcact 8460 
acccgtcatg actccccgga cgctgacctc atcgaggcca acctcctgtg gcggcaggag 8520 
atgggcggga acatcacccg cgtggagtca gaaaataagg tagtaatttt ggactctttc 8580 
gagccgctcc aagcggagga ggatgagagg gaagtatccg ttccggcgga gatcctgcgg 8640 
aggtccagga aattccctcg agcgatgccc atatgggcac gcccggatta caaccctcca 8700 
ctgttagagt cctggaagga cccggactac gtccctccag tggtacacgg gtgtccattg 8760 
ccgcctgcca aggcccctcc gataccacct tcacggagga agaggacggt tgtcctgtca 8820 
gaatctaccg tgtcttctgc cttggcggag ctcgccacag agaccttcgg cagctccgaa 8880 
tcgtcggccg tcgacagcgg cacggcaacg gcctctcctg accagccctc cgacgacggc 8940 
gacgcgggat ccgacgttga gtcgtactcc tccatgcccc cccttgaggg ggagccgggg 9000 
gatcccgatc tcagcgacgg gtcttggtct accgtaagcg aggaggctag tgaggacgtc 9060 
gtctgctgct cgatgtccta cacatggaca ggcgccctga tcacgccatg cgctgcggag 9120 
gaaaccaagc tgcccatcaa tgcactgagc aactctttgc tcqgtcacca caacttggtc 9180 
tatgctacaa catctcgcag cgcaagcctg cggcagaaga aggtcacctt tgacagactg 9240 
caggtcctgg acgaccacta ccgggacgtg ctcaaggaga tgaaggcgaa ggcgtccaca 9300 
gttaaggcta aacttctatc cgtggaggaa gcctgtaagc' tgacgccccc acattcggcc .9360 
agatctaaat ttggctatgg ggcaaaggac gtccggaacc tatccagcaa ggccgttaac 9420 
cacatccgct ccgtgtggaa ggacttgctg gaagacactg agacaccaat tgacaccacc 9480 
atcatggcaa aaaatgaggt tttctgcgtc caaccagaga aggggggccg caagccagct 9540 
cgccttatcg tattcccaga tttgggggtt cgtgtgtgcg agaaaatggc cctttacgat 9600 
gtggtctcca ccctccctca ggccgtgatg ggctcttcat acggattcca atactctcct 9660 
ggacagcggg tcgagttcct ggtgaatgcc tggaaagcga agaaatgccc tatgggcttc 9720 
gcatatgaca cccgctgttt tgactcaacg gtcactgaga atgacatccg tgttgaggag 9780 
tcaatctacc aatgttgtga cttggccccc gaagccagac aggccataag gtcgctcaca 9840 
gagcggcttt acatcggggg ccccctgact aattctaaag ggcagaactg cggctatcgc 9900 
cggtgccgcg cgagcggtgt actgacgacc agctgcggta ataccctcac atgttacttg 9960 
aaggbcgctg cggcctgtcg agctgcgaag ctccaggact gcacgatgct cgtatgcgga 10020 
gacgaccttg tcgttatctg tgaaagcgcg gggacccaag aggacgaggc gagcctacgg 10080 
gccttcacgg aggctatgac tagatactct gccccccctg gggacccgcc caaaccagaa 10140 
tacgacttgg agttgataac atcatgdtcc tccaatgtgt cagtcgcgca cgatgcatct 10200 
ggcaaaaggg tgtactatct cacccgtgac cccaccaccc cccttgcgcg ggctgcgtgg 10260 
gagacagcta gacacactcc agtcaattcc tggctaggca acatcatcat gtatgcgccc 10320 
accttgtggg caaggatgat cctgatgact catttcttct ccatccttct agctcaggaa 10380 
caacttgaaa aagccctaga ttgtcagatc'tacggggcct gttactccat tgagccactt 10440 
gacctacctc agatcattca acgactccat ggccttagcg cattttcact ccatagttac 10500 
tctccaggtg agatcaatag ggtggcttca tgcctcagga aacttggggt accgcccttg 10560 
cgagtctgga gacatcgggc cagaagtgtc cgcgctaggc tactgtccca gggggggagg 10620 
igctgccactt gtggcaagta cctcttcaac tgggcagtaa ggaccaagct caaactcact 10680 
ccaatcccgg ctgcgtccca gttggattta tccagctggt tcgttgctgg ttacagcggg 10740 
ggagacatat atcacagcct gtctcgtgcc cgaccccgct ggttcatgtg gtgcctactc 10800 
ctactttctg tagggigtagg catctatcta ctccccaacc gatgaacggg gagctaaaca 10860 
ctccaggcca ataggccatc ctgttttttt cccttttttt ttttcttttt tttttttttt 10920 
tttttttttt tttttttttc tccttttttt ttcctctttt tttccttttc tttcctttgg 10980 
tggctccatc ttagccctag tcacggctag ctgtgaaagg tccgtgagcc gcttgactgc 11040 
agagagtgct gatactggcc tctctgcaga tcaagt 11076 
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<210> 10 
<211> 8001 
<212> DNA 

<213> Hepatitis C Virus 
<400> id 

gccagccccc gattgggggc gacactccac catagaticac tcccctgtga ggaactactg 60 
. tcttcacgca gaaagcgtct agccatggcg ttagtatgag tgtcgtgcag cctccaggac 120 
cccccctcbc gggagagcca tagtggtctg cggaaccggt gagtacaccg gaattgccag 180 
gacgaccggg tcctttcttg gatcaacccg ctcaatgcct ggagatttgg gcgtgccccc 240 
gc^agactgc tagccgagta gtgttgggtc gcgaaaggcc ttgtggtact gcctgatagg 300 
gtgcttgcga gtgccccggg aggtctcgta gaccgtgdac catgagcacg aatcctaaac 360 
ctcaaagaaa aaccaaacgt aacaccaacg ggcgcgccat gattgaacaa gatggattgc 420 
acgcaggttc tccggccgct tgggtggaga ggctattcgg ctatgactgg gcacaacaga 480 
caatcggctg ctctgatgcc gccgtgttcc ggctgtcagc gcaggggcgc ccggttcttt 540 
ttgtcaagac cgacctgtcc ggtgccctga atgaactgca ggacgaggca gcgcggctat 600 
cgtggctggc cacgac^ggc gttccttgcig cagctgtgct cgacgttgtc actgfaagcgg 660 
gaagggactg gctgctattg ggcgaagtgc cggggcagga tctcctgtca tctcaccttg 720 
ctcctgccga gaaagtatcc atcatggctg atgcaatgcg gcggctgcat acgcttgatc 780 
cggctacetg cccattcgac caccaagcga aacatcgcat cgagcgagca cgtactcgga 840 
tggaagccgg tcttgtcgat caggatgatc tggacgaaga gcatcag^gg ctcgcgccag 900 
cdgaactgtt cgccaggctc aaggcgcgca tgcccgacgg cgaggatctc gtcgtgaccc 960 
atggcgatgc ctgcttgccg aatatcatgg tggaaaatgg ccgcttttct ggattcatcg 1020 
actgtggccg gctgg^ftgtg gcggaccgct atcaggacat agcgttggct acccgtgata 1080 
ttgctgaaga gcttggcggc gaatgggctg accgcttcct cgtgctttac ggtatcgccg 1140 
ctcccgattc gcagcgcatc gccttctatc gccttcttga cgagttcttc tgagtttaaa 1200 
cagaccacaa cggtttccct ctagcgggat caattccgcc cctctccctc ccccccccct 1260 
aacgttactg gccgaagccg cttggaataa ggccggtgtg cgtttgtcta tatgttattt 1320 
tccaccatat tgccgtcttt tggcaatgtg agggcccgga aacctggccc tgtcttcttg 1380 
acgagcattc ctaggggtct ttcccctctc gccaaaggaa tgcaaggtct gttgaatgtc 14 40 
gtgaaggaag cagttcctct ggaagcttct tgaagacaaa caacgtctgt agcgaccctt 1500 
tgcaggcagc ggaacccccc acctggcgac aggtgcctct gcggccaaaa gccacgtgta 1560 
taagatacac ctgcaaaggc ggcacaaccc cagtgccacg ttgtgagttg gatagttgtg 1620 
gaaagagtca aatggctctc ctcaagcgta ttcaacaagg ggctgaagga tgcccagaag 1680 
gtaccccatt. gtatgggatc tgatctgggg cctcggtgca catgctttac atgtgtttag 1740 
tcgaiggttaa aaaacgtcta ggccccccga accacgggga cgtggttttc ctttgaaaaa 1800 
cacigataata ccatggicgcc tattacggbc tactcccaac agacgcgagg cctacttggc 1860 
tgcatcatca ctagcctcac aggccgggafc aggaaccagg tcgaggggga g<gtccaagtg 1920 
gtctccaccg caacacaatc tttcctggcg acctgcgtca atggcgtgtg ttggactgtc 1980 
tatcatggtg ccggctcaaa gacccttgcc ggcccaaagg gcccaatcac ccaaatgtac 2040 
accaatgtgg accaggacct cgtcggctgg caagcgcccc ccggggcgcg ttccttgaca 2100 
ccatgcacct gcggcagctc ggacctttac ttggtcacga ggcatgccga tgtcattccg 2160 
gtgcgccggc ggggcgacag cagggggagc ctactctccc ccaggcccgt ctcctacttg 2220 
aagggctctt cgggcggtcc actgctctgc ccctcggggc acgctgtggg catctttcgg 2280 
gctgccgtgt gcacccgagg ggttgcgaag gcggtggact ttgtacccgt cgagtctatg 2340 
ggaaccacta tgcggtcccc ggtcttcacg'gacaactcgt cccctccggc cgtaccgcag 2400 
acattccagg tggcccatct acacgcccct actggtagcg gcaagagcac taaggtgccg 24 60 
gctgcgtatg caggccaagg gtataaggtg cttgtcctga acccgtccgt cgccgccacc 2520 
ctaggtttcg gggcgtatat gtctaaggca catggtatcg accctaacat cagaatcggg 2580 
gtaaggacca tcaccacggg tgcccccatc acgtactcca cctatggcaa gtttcttgcc 2640 
gacggtggtt gctctggggg cgcctatgac atcataatat gtgatgagtg cdactcaact 2700 
gactcgacca ctatcctggg catcggcaca gtcctggacc aagcggagac ggctggagcg 2760 
cgactcgtcg tgctcgccac cgctacgcct ccgggatcgg tcaccgtgcc acatbcaaac 2820 
atcgaggagg tggctctgtc cagcactgga gaaatcccct tttatggcaa agccatcccc 2880 
atcgagacca tcaagggggg gaggcacctc attttctgcc attccaagaa gaaatgtgat 2940 
gagctcgccg cgaagctgtc cggcctcgga ctcaatgctg tagcatatta ccggggcctt 3000 
gatgtatccg tcataccaac tagcggagac gtcattgtcg tagcaacgga cgctctaatg 3060 
acgggcttta ccggcgactt cgactcagtg atcgactgca atacatgtgt cacccagaca 3120 
gtcgacttca gcctggaccc gaccttcacc attgagacga cgaccgtgcc acaagacgcg 3180 
gtgtcacgct cgcagcggcg aggcaggact ggtaggggca ggatgggcat ttacaggttt 3240 
gtgactccag gagaacggcc ctcgggcatg ttcgattcct cggttctgtg cgagtgctat 3300 
gacgcgggct gtgcttggta cgagctcacg cccgccgaga cctcagttag gttgcgggct 3360 
tacctaaaca caccagggtt gcccgtctgc caggaccatc tiggagttctg ggagagcgtc 3420 
tttacaggcc tcacccacat agacgcccat ttcttgtccc agactaagca ggcaggagac 3480 
aacttcccct acctggtagc ataccaggct acggtgtgcg ccagggctca ggctccacct 3540 
ccatcgtggg accaaatgtg gaagtgtctc atacggctaa agcctacgct gcacgggcca 3600 
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acgcccctgc tgtataggct gggagccgtt caaaacgagg ttactaccac acaccccata 3660 
accaaataca tcatggcatg catgtcggct gacctggagg tcgtcacgag cacctgggtg 3720 
ctggtaggcg gagtcctagc agctctggcc gcgtattgcc tgacaacagg cagcgtggtc 3780 
attgtgggcaggatcatctt gtccggaaag ccggccatca ttcccgacag ggaagtcctt 3840 
taccgggagt tcgatgagat ggaagagtgc gcctcacacc tcccttacat cgaacaggga 3900 
atgcagctcg cc^aacaatt caaacagaag gcaatcgggt tgctgcaaac agccaccaag 3960 
caagcggagg ctgctgctcc cgtggtggaa tccaagtggc ggaccatcga agccttctgg 4020 
gcgaagcata tgtggaattt catcagcggg atacaatatt tagcaggctt gtccactctg 4080 
cctggcaacc ccgcgatagc atcactgatg gcattcacag cctctatcac cagcccgctc 4140 
accacdcaac ataccctcct gtttaacatc ctggggggat gggtggccgc ccaacttgct 4200 
,cctccc:agcg ctgcttctgc tttcgtaggc gccggcatcg ctggagcggc tgttggcagc 4260 
ataggccttg ggaaggtgct tgtggatatt ttggcaggtt atggagcagg ggtggcaggc 4 320 
gcgctcgtgg ccttitaaggt catgagcggc gagttgccct ccaccgagga cctggttaac 4380 
ctactccctg ctatcctctc ccctggcgcc ctagtcgtcg gggtcgtgtg cgcagcgata 4440 
ctgcgtcggc acgtgggccc aggggagggg gctgtgcagt ggat^aaccg gctgatagcg 4500 
ttcgcttcgc ggggtaacca cgtctccccc acgcactatg tgcctgagag cgacgctgca 4560 
gcacgagtca ctcagatcct ccctagtctt accatcactc agctgctgaa gaggcttcac 4 620 
cagtggatca acgaggactg ctccacgcca tgctccggct cgtggctaag agatgtttgg 4680 
gattggatat gcacggtgtt gactgatttc aagacctggc tccagtccaa gctcctgccg 4740 
cgattgccgg gagtcccctt cttctcatgt caacgtgggt acaagggagt ctggcggggc 4800 
gacggcatca tgcaaaccac ctgcccatgt ggagcacaga tcaccggaca tgtgaaaaac 4860 
ggttccat^a ggatcgtggg gcctaggacc tgtagtaaca cgtggcatgg aacattcccc 4 920 
attaacgcgt acaccacggg cccctgcacg ccctccccgg cgccaaatta ttctagggcg 4980 
ctgtggcggg taggtgdtga ggagtacgtg gaggttacgc gggtggggga tttccactac 5040 
gtgacgggca tgaccactga caacgtaaag tgcccgtgtc aggttccggc ccccgaattc 5100 
ttcacagaag tggatggggt gcggttgcac aggtacgctc cagcgtgcaa acccctccta 5160 
cgggaggagg tcacattcct ggtcgggctc aatcaatacc tggttgggtc acagctccca 5220 
tgcgagcctg aaccggatgt agcagtgctc acttccatgc tcaccgaccc ctcccacatt 5280 
acggcggaga cggctaagcg taggctggcc aggggatctc cccccccctt ggccagctca 5340 
tcagctagcc.agctgtctgc gccttccttg aaggcaacat gcactacccg tcatgactcc 5400 
ccggacgctg acctcatcga ggccaacctc ctgtggcggc aggagatggg cgggaacatc 5460 
acecgcgtgg agtcagaaaa taaggtagta attttggact ctttcgagcc gctccaagcg 5520 
gaggaggatg agaggggagt aticcgttccg gcggagatcc tgcggaggtc caggaaattc 5580 
cctcgagcga tgcccatatg ggcacgcccg gattacaacc ctccactgtt agagtcctgg 5640 
aaggacccg^ actacgtccc tccagtggta cacgggtgtc cattgccgcc tgccaaggcc 5700 
cctccgatac caccttcacg gaggaagagg acggttgtcc tgtcagaatc taccgtgtct 5760 
tctgccttgg cggagctcgc cacagagacc ttcggcagct ccgaatcgtc ggccgtcgac 5820 
agcggcacgg caacggcctc tcctgaccag ccctccgacg acggcgacgc gggatccgac 5880 
gttgagtcgt actcctccat gccccccctt gagggggagc cgggggatcc cgatctcagc 5940 
gacgggtctt ggtctaccgt aagcgaggag gctagtgagg acgtcgtctg ctgctcgatg 6000 
tcctacacat ggacaggcgc cctgatcacg ccatgcgctg cggaggaaac caagctgccc 6060 
atcaatgcac tgagcaactc tttgctccgt caccacaact tggtctatgc tacaacatct 6120 
cgcagcgcaa acctgcggca gaagaaggtc acctttgaca gactgcaggt cctggacgac 6180 
cactaccggg acgtgctcaa ggagatgaag gcgaaggcgt ccacagttaa ggctaaactt 6240 
ctatccgtgg aggaagcctg taagctgacg cccccacatt cggccagatc taaatttggc 6300 
tatggggcaa aggacgtccg gaacctatcc agcaaggccg ttaaccacat ccgctccgtg 6360 
tggaaggact tgctggaaga cactgagaca' ccaattgaca ccaccatcat ggcaaaaaat 6420 
gaggttttct gcgtccaacc agagaagggg ggccgcaagc cagctcgcct tatcgtattc 6480 
ccagatttgg gggttcgtgt gtgcgagaaa atggcccttt acgatgtggt ctccaccctc 6540 
cctcaggccg tgatgggctc ttcatacgga ttccaatact ctcctggaca gcgggtcgag 6600 
ttcctggtga atgcctggaa agcgaagaaa tgccctatgg gcttcgcata tgacacccgc 6660 
tgttttgact caacggtcac tgagaatgac atccgtgttg aggagtcaat ctaccaatgt 6720 
tgtgacttgg cccccgaagc cagacaggcc ataaggtcgc tcacagagcg gctttacatc 6780 
gggggccccc tgactaattc taaagggcag aactgcggct atcgccggtg ccgcgcgagc 6840 
ggtgtactga cgaccagctg cggtaatacc ctcacatgtt acttgaaggc cgctgcggcc 6900 
tgtcgagctg cgaagctcca ggactgcacg atgctcgtat gcgfgagacga ccttgtcgtt 6960 
atctgtgaaa gcgcggggac ccaagaggac gaggcgagcc tacgggcctt cacggaggct 7020 
atgactagat actctgcccc ccctggggac ccgcccaaac cagaatacga cttggagttg 7080 
ataacatcat gctcctccaa tgtgtcagtc gcgcacgatg catctggcaa aagggtgtac 7140 
tatctcaccc gtgaccccac cacccccctt gcgcgggctg cgtgggagac agctagacac 7200 
actccagtca attcctggct aggcaacatc atcatgtatg cgcccacctt gtgggcaagq 7260 
atgatcctga tgactcattt cttctccatc cttctagctc aggaacaact tgaaaaagcc 7320 
ctagattgtc agatctacgg ggcctgttac tccattgagc cacttgacct acctcagatc 7380 
attcaacgac tccatggcct tagcgcattt tcactccata gttactctcc aggtgagatc 7440 
aatagggtgg cttcatgcct caggaaactt ggggtaccgc ccttgcgagt ctggagacat 7500 
cgggccagaa gtgtccgcgc taggctactg tcccaggggg ggagggctgc cacttgtggc 7560 
aagtacctct tcaactgggc agtaaggacc aagctcaaac tcactccaat cccggctgcg 7620 
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# tcccagttgg atttatccag ctggttcgtt gctggttaca gcgggggaga catatatcac 7680 

agcctgtctc gtgcccgacc ccgctggttc atgtggtgcc tactcctact ttctgtaggg 7740 
gtaggcatct atctactccc caaccgatga acggggagct aaacactcca ggccaatagg 7800 
ccatcctgtt tttttccctt tttttttttc tttttttttt tttttttttt tttttttttt 7860 
; A' ■ ttttctcctt; tttttttcct ctttttttcc ttttctttcc tttggtggct ccatcttagc 7920 

cctagtcacg gctagctgtg aaaggtccgt gagccgcttg actgcagaga gtgctgatac 7980 
tggcctctct gcagatcaag t 8001 
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<210> 11 
<211> 11076 
<212> DNA 

<213> Hepatitis C Virus 



<40O> 11 . 

gccagccccc gattgggggc gacactccac catagatcac tcccctgtga ggaactactg 60 
tcttcacgca gaaagcgtct agccatggcg ttagtatgag tgtcgtgcag cctccaggac 120 
cccccctccc gggagagcfca tagtggtctg cggaaccggt gagtacaccg gaattgccag 180 
gacgaccggg tcctttcttg gatcaacccg ctcaatgcct ggagatttgg gcgtgccccc 240 
gcgagactgc tagccgagta gtgttgggtc gcgaaaggcc ttgtggtact gcctgatagg 300 
gtgcttgcga gtgccccggg aggtctcgta gaccgtgcac catgagcacg aatcctaaac 360 
ctcaaagaaa aaccaaacgt aacaccaacg ggcgcgccat gattgaacaa gatggattgc 420 
acgcaggttc tccggccgct tgggtggaga ggctattcgg ctatgactgg gcacaacaga 480 
caatcggctg ctctgatgcc gccgtgttcc ggctgtcagc gcaggggcgc ccggttcttt 540 
ttgtcaagac cgacctgtcc ggtgccctga atgaactgca ggacgaggca gcgcggctat 600 
cgtggctggc cacgacgggc gttccttgcg cagctgtgct cgacgttgtc actgaagcgg 660 
... gaagggactg gctgctattg ggcgaagtgc cggggcagga tctcctgtca tctcaccttg 720 
ctcctgccga gaaagtatcc atcatggctg atgcaatgcg gcggctgcat acgcttgatc 780 
cggctacctg cccattcgac caccaagcga aacatcgcat cgagcgagca cgtactcgga 840 
tggaagccgg tcttgtcgat caggatgatc tggacgaaga gcatcagggg ctcgcgccag 900 
ccgaactgtt cgccaggctc aaggcgcgca tgcccgacgg cgaggatctc gtcgtgaccc 960 
atggcgatgc ctgcttgccg aatatcatgg tggaaaatgg ccgcttttct ggattcatcg 1020 
actgtggccg gctgggtgtg gcggacegct atcaggacat agcgttggct acccgtgata 1080 
ttgctgaaga gcttggcggc gaatgggctg accgcttcct cgtgctttac ggtatcgccq 1140 
ctcccgattc gcagcgcatc gccttctatc gccttcttga cgagttcttc tgagtttaaa 1200 
cagaccacaa cggtttccct ctagcgggat caattccgcc cctctccctc ccccccccct 1260 
aacgttactg gccgaagccg cttggaataa ggccggtgtg cgtttgtcta tatgttattt 1320 
tccaccatat tgccgtcttt tggcaatgtg agggcccgga aacctggccc tgtcttcttg 1380 
acgagcattc ctaggggtct ttcccctctc gccaaaggaa tgcaaggtct gttgaatgtc 1440 
gtgaaggaag cagttcctct ggaagcttct tgaagacaaa caacgtctgt agcgaccctt 1500 
tgcaggcagc ggaacccccc acctggcgac aggtgcctct gcggccaaaa gccacgtgta 1560 
taagatacac ctgcaaaggc ggcacaaccc cagtgccacg ttgtgagttg gatagttgtg 1620 
ZiVtfZ* aatggctctc ctcaagcgta ttcaacaagg ggctgaagga tgcccagaag 1680 
gtaccccatt gtatgggatc tgatctgggg cctcggtgca catgctttac atgtgtttag 1740 
tcgaggttaa aaaacgtcta ggccccccga accacgggga cgtggttttc ctttgaaaaa 1800 
cacgataata ccatgggcac gaatcctaaa cctcaaagaa aaaccaaacg taacaccaac 1860 
cgccgcccac aggacgtcaa gttcccgggc ggtggtcaga tcgtcggtgg agtttacctg 1920 
Itl^r 9 ? ^ggccccag gttgggtgtg cgcgcgacta ggaagacttc cgagcggtc| 1980 
caacctcgtg gaaggcgaca acctatcccc aaggctcgcc agcccgaggg tagggcctgg 2040 
^f 9 ^ 09 ggtacccctg gcccctctat ggcaatgagg gcttggggtg ggcaggatgg 2100 
ctcctgtcac cecgtggctc tcggcctagt tggggcccca cggacccccg gdgtlggtcg 2160 
cgcaatttgg gtaaggtcat cgataccctc acgtgcggct tcgccgatct catggggtal 2220 
attccgctcg tcggcgcccc cctagggggt: gctgccaggg ccctggcgca tggcgtccgg 2280 
g "^ 9g f 9g acggcgtgaa ctatgcaaca gggaatctgc ccggttgctc cttttctatl 2340 
itttlllall 9Ct f I ^tttgafcc-'atcdcagctt ccgcttatga agtgcgcaac 2400 
gtatccggag tgtaccatgt cacgaacgac tgctccaacg caagcattgt gtatgaggca 2460 
ItlllttlTr. ^ Cat9C f taC ccc ^9tga gtgccctgcg ttcgggagaa caactcctcc 2520 
tagcgctcac tcccacgctc gcggccagga aogctagcgt ccccactacg 2580 
SSSJSJ I C 5 a ? 9tCiga tt:t 9<=tcgtt ggggcggctg ctctctgctc cgctatgtal 2640 
11^-111° tct r 9gatC tgtttt6 °^c gtcgcccagc tgttcacctt ctcgcc£cgc 2700 
cggcacgaga cagtacagga ctgcaattgc tcaatatatc ccggccacgt gacaggtcac 2760 
tllltlllll gggatatgat gatgaactgg tcacctacag cagccctagt ggtatdgcag 2820 
ItZZZlltE ^r^ 33 ? 0 ^ tc ^ at atggtggcgg gggcccattg gggagtcctl 2880 
g ? ggg ^" 9 ccta ctattc catggtgggg aactgggcta aggttctgat tg£gatgcta 2940 
c C ^"^ a g gcgttgacgg gggaacctat gtgacagggg ggacgatggc caaaaacacc 3000 
millt I c g tcct =tctt ttcacccggg tcatcccaga aaatccagct tgtaaacacc 3060 
HlltT q< : " C ? CatCaa caggactgcc ctgaactgca atgactccct caacactggg 3120 
ttccttgctg cgctgttcta cgtgcacaag ttcaactcat ctggatgccc agagcgcatq 3180 
gccagctgca gccccatcga cgcgttcgct caggggtggg ggcccatcac ttacaatgag 3240 
£llJ^l CggaCCagag gccttattgt tggcactacg caccccggcc gtgcggtatc 3300 
gtacccgcgg cgcaggtgtg tggtccagtg tactgcttca ccccaagccc tgtcgtggtg 3360 
ggga ° ga ^ g accggttcgg cgtccctacg tacagttggg gggagaatga gacggacgtg 3420 
fnlfJtl I acaacacgcg gccgccgcaa ggcaactggt ttggctgtac atggatgaat 3480 
agcactgggt tcaccaagac gtgcgggggc cccccgtgta acatcggggg gatcggcaat 3540 
aaaaccttga cctgccccac ggactgcttc cggaagcacc ccgag?ccac ?tacaccaag 3600 
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tgtggttcgg ggccttggtt gacacccaga tgcttggtcc actacccata caggctttgg 3660 
cactacccct gcactgtcaa ctttaccatc ttcaaggtta ggatgtacgt ggggggagtg 3720 
gagcacaggc tcgaagccgc atgcaattgg actcgaggag agcgttgtaa cctggaggac 3780 
agggacagat cagagcttag cccgctgctg ctgtctacaa cggagtggca ggtattgccc 3840 
tgttccttca ccaccctacc ggctctgtcc actggtttga tccatctcca tcagaacgtc 3900 
gt^gacgtac aatacctgta cggtataggg tcggcggttg tctcctttgc aatcaaatgg 3960 
. gagtatgtcc tgttgctctt ccttcttctg gcggacgcgc gcgtctgtgc ctgcttgtgg 4020 
: atgatgctgc tgatagctca agctgaggcc gccctagaga acctggtggt cctcaacgcg 4080 
gcatccgtgg ccggggcgca tggcattctc tccttcctcg tgttcttctg tgctgcctgg 4140 
tacatcaagg gcaggctggt ccctggggcg gcatatgccc tctacggcgt atggcdgcta 4200 
ctcctgctcc.tgctggcgtt accaccacga gcatacgcca tggaccggga gatggcagca 4260 
tcgtgcggag gcgcggtttt cgtaggtctg atactcttga ccttgtcacc gcactataag 4320 
. ctgttcctcg ctaggctcat atggtggtta caatatttta tcaccagggc cgag^cacac 4380 ' 

ttgcaagtgt ggatcccccc cctcaacgtt cgggggggcc gcgatgccgt catcctcctc 4440 
. acgtgcgcga tccacccaga gctaatcttt accateacca aaatcttgct cgccatactc 4 500 
ggtccactca tggtgctcca ggctggtata accaaagtgc cgtacttcgt gcgcgcacac 4 560 
gggctcattc gtgcatgcat gctggtgcgg aaggttgctg ggggtcatta tgtccaaatg 4620 
gctctcatga agttggccgc actgacaggt acgtacgttt atgaccatct caccccactg 4 680 
. cgggactgggcccacgcggg cctacgagac cttgcggtgg cagttgagcc cgtcgtcttc 4740 
tctgatatgg agaccaaggt tatcacctgg ggggcagaca ccgcggcgtg tggggacatc 4800 
attttgggcc tgcccgtctc cgcccgcagg gggagggaga tacatctggg accggcagac 4860 
agccttgaag ggcaggggtg gcgactcctc gcgcctatta cggdctactc cfcaacagacg 4 920 
cgaggcctac ttggctgcat catcactagc ctcacaggcc gggacaggaa ccaggtcgag 4980 
:.ggggaggtcc aagtggtctc caccgcaaca caatctttcc tggcgacctg cgtcaatggc 5040 
. gtgtgttgga ctgtctatca tggtgccggc tcaaagaccc ttgccggccc aaagggccca 5100 
atcacccaaa tgtacaccaa tgtggaccag gacctcgtcg gctggcaagc gccccccggg 5160 
gcgcgttcct tgacaccatg cacctgcggc agctcggacc tttacttggt cacgaggcat 5220 
gccgatgtca ttccgigtgcg ccggcggggc gacagcaggg ggagcctact ctcccccagg 5280 
cccgtctcct acttgaaggg ctcttcgggc ggtccactgc tctgcccctc ggggcacgct 5340 
gtgggcatct ttcgggctgc cgtgtgcacc cgaggggttg cgaaggcggt ggactttgta 5400 
cccgtcgagt ctatgggaac cactatgcgg tccccggtct tcacggacaa ctcgtcccct 54 60 
ccggccgtac cgcagacatt ccaggtggcc catctacacg cccctactgg tagcggcaag 5520 
.agcactaagg tgccggctgc gtatgcaggc caagggtata aggtgcttgt cctgaacccg 5580 
tccgtcgccg ccaccctagg tttcggggcg tatatgtcta aggcacatgg tatcgaccct 5640 
aacatcagaa tcggggtaag gaccatcacc acgggtgccc ccatcacgta ctccacctat 5700 
ggcaagtttc ttgccgacgg tggttgctct gggggcgcct atgacatcat aatatgtgat 5760 
gagtgccact caactgactc gaccactatc ctgggcatcg gcacagtcct ggaccaagcg 5820 
gagacggctg gagcgcgact cgtcgtgctc gccaccgcta cgcctccggg atcggtcacc 5880 
gtgccacatc caaacatcga ggaggtggct ctgtccagca ctggagaaat ccccttttat 5940 
ggcaaagcca tccccatcga gaccatcaag ggg^rggaggc acctcatttt ctgccattcc 6000 
aagaagaaat gtgatgagct cgccgcgaag ctgtccggcc tcggactcaa tgctgtagca 6060 
tattaccggg gccttgatgt atccgtcata ccaactagcg gagacgtcat tgtcgtagca 6120 
•acggacgctc taatgacggg ctttaccggc gacttcgact cagtgatcga ctgcaataca 6180 
tgtgtcaccc agacagtcga cttcagcctg gacccgacct tcaccattga gacgacgacc 6240 
gtgccacaag acgcggtgtc acgctcgcag cggcgaggca ggactggtag gggcaggatg 6300 
ggcatttaca ggtttgtgac tccaggagaa cggccctcgg gcatgttcga ttccteggtt 6360 
ctgtgcgagt gctatgacgc gggctgtgct^tggtacgagc tcacgcccgc cgagacctca 6420 
.gttaggttgc gggcttadct aaacacacca gggttgcccg tctgccagga ccatctggag 6480 
ttctgggaga gcgtctttac aggcctcacc cacatagacg cccatttctt gtcccagact 6540 
aagcaggcag gagadaactt cccctacctg gtagcatacc aggctacggt gtgcgccagg 6600 
gctcaggctc cacctccatc gtgggaccaa atgtggaagt gtctcatacg gctaaagcct 6660 
acgctgcacg ggccaacgcc cctgctgtat aggctgggag ccgttcaaaa cgaggttact 6720 
accacacacc ccataaccaa atacatcatg gcatgcatgt cggctgacct ggaggtcgtc 6780 
acgagcacct gggtgctggt aggcggagtc ctagcagctc tggccgcgta ttgcctgaca 6840 
acaggcagcg tggtcattgt gggcaggatc atcttgtccg gaaagccg^c catcattccc 6900 
gacagggaag tcctttaccg ggagttcgat gagatggaag agtgcgcctc acacctccct 6960 
tacatcgaac agggaatgca gctcgccgaa caattcaaac agaaggcaat cgggttgctg 7020 
caaaeagcca ccaagcaagc ggaggctgct gctcccgtgg tggaatccaa gtggcggacc 7080 
atcgaagcct tctgggcgaa gcatatgtgg aatttcatca gcgggataca atatttagca 7140 
ggcttgtcca ctctgcctgg caaccccgcg atagcatcac tgatggcatt cacagcctct 7200 
atcaccagcc cgctcaccac ccaacatacc ctcctgttta acatcctggg gggatgggtg 7260 
gccgcccaac ttgctcctcc cagcgctgct tctgctttcg taggcgccgg catcgctgga 7320 
gcggctgttg gcagcatagg ccttgggaag gtgcttgtgg atattttggc aggttatgga 7380 
gcaggggtgg caggcgcgct cgtggccttt aaggtcatga gcggcgagtt gccctccacc 7440 
gaggacctgg ttaacctact ccctgctatc ctctcccctg gcgccctagt cgtcggggtc 7500 
gtgtgcgcag cgatactgcg tcggcacgtg ggcccagggg agggggctgt gcagtggatg 7560 
aaccggctga tagcgttcgc ttcgcggggt aaccacgtct cccccacgca ctatgtgcct 7620 
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gagagcgacg ctgcagcacg agtcactcag atcctctcta gtcttaccat cactcagctg 7680 
Ctgaacjaggc ttcaccagtg gatcaacgag gactgctcca cgccatgctc cggctcgtgg 7740 
ctaagagatg tttgggattg gatatgcacg gtgttgactg atttcaagac ctggctccag 7800 
tccaagctcc tgccgcgatt gccgggagtc cccttcttct catgtcaacg tgggtacaag 7860 
ggagtctggc ggggcgacg^ catcatgcaa accacctgcc catgtggagc acagatcacc 7920 
g^acatgtga aaaacggttc catgaggatc gtggggccta ggacctgtag taacacgtgg 7980 
catggaacat tccccattaa cgcgtacacc acgggcccct gcacgccctc cccggcgcea 8040 
aattattcta gggcgctgtg gcgggtaggt gctgaggagt acgtggaggt tacgcgggtg 8100 
ggggatttcc actacgtgac gggcatgacc actgacaacg taaagtgccc gtgtcaggtt 8160 
ccggcccccg aattcttcac agaagtggat ggggtgcggt tgcacaggta cgctccagcg 8220 
tgcaaacccc tcctacggga ggaggtcaca ttcctggtcg ggctcaatca atacctggtt 8280 
gggtcacagc tcccatgcga. gcctgaaccg gatgtagcag tgetcacttc catgctcacc 8340 
gacccctccc acattacggc ggagacggct aagcgtaggc tggccagggg atctcccccc 8400 * 
cccttggcca gctcatcagc tagccagctg tctgcgcctt ccttgaaggc aacatgcact 84 60 
acccgtcatg actccccgga cgctgacctc atcgag^cca acctcctgtg gcggcaggag 8520 
atgggcggga aGatcacccg cgtggagtca gaaaataagg tagtaatttt ggactctttc 8580 
gagccgctcc aagcggagga ggatgagagg ggagtatccg ttccggcgga gatcctgcgg 8640 
aggtccagga aattccctcg agcgatgccc atatgggcac gcccggatta caaccctcca 8700 
ctgttagagt cctggaagga cccggactac gtccctccag tggtacacgg gtgtccattg 8760 
ccgcctgcca aggcccctcc gataccacct tcacggagga agaggacggt tgtcctgtca 8820 
gaatctaccg tgtcttctgc cttggcggag ctcgccacag agaccttcgg cagctccgaa 8880 
tcgtcggccg tcgabagcgig cacggcaacg gcctctcctg accagccctc cgacgacggc 8940 
gacgcgggat ccgacgttga . gtcgtactcc tccatgcccc cccttgaggg ggagccgggg 9000 
gatcccgatc tcagcgacgg gtcttggtct accgtaagcg agsgaggctag tgaggacgtc 9060 
gtctgctgct cgatgtccta cacatggaca ggcgccctga tcacgccatg cgctgirggag 9120 
gaaaccaagc tgcccatcaa tgcactgagc aactctttgc tccgtcacca caacttggtc 9180 
tatgctacaa catctcgcag cgcaaacctg cggcagaaga aggtcacctt tgacagactg 9240 
caggtcctgg acgaccacta ccgggacgtg ctcaaggaga tgaaggcgaa ggcgtccaca 9300 
gttaaggcta aacttctatc cgtggaggaa gcctgtaagc tgacgccccc acattcggcc 9360 
agatctaaat ttggctatgg ggcaaaggac gtccggaacc tatccagcaa ggccgttaac 9420 
cacatccgct ccgtgtggaa ggacttgctg gaagacactg agacaccaat tgacaccacc 9480 
atcatggcaa aaaatgaggt tttctgcgtc caaccagaga aggggggccg caagccagct 9540 
cgccttatcg tattcccaga tttgggggtt cgtgtgtgcg agaaaatggc cctttacgat 9600 
gtggtctcca ccctccctca ggccgtgatg ggctcttcat acggattcca atactctcct 9660 
ggacagcggg tcgagttcct ggtgaatgcc tggaaagcga agaaatgccc tatgggcttc 9720 
gcatatgaca cccgctgttt tgactcaacg gtcactgaga atgacatccg tgttgaggag 9780 
tcaatctacc aatgttgtga cttggccccc gaagccagac aggccataag gtcgctcaca 9840 
gagcggcttt acatcggggg ccccctgact aattctaaag ggcagaactg cggctatcgc 9900 
cggtgccgcg cgagcggtgt actgacgacc agctgcggta ataccctcac atgttacttg 9960 
aaggccgctg cggcctgtcg agctgcgaag ctccaggact gcacgatgct cgtatgcgga 10020 
gacgaccttg tcgttatctg tgaaagcgcg gggacccaag aggacgaggc gagcctacgg 10080 
gccttcacgg aggctatgac tagatactct gccccccctg gggacccgcc caaaccagaa 10140 
tacgacttgg agttgataac atcatgctcc tccaatgtgt cagtcgcgca cgatgcatct 10200 
ggcaaaaggg tgtactatct cacccgtgac cccaccaccc cccttgcgcg ggctgcgtgg 10260 
gagacagcta gacacactcc agtcaattcc tggctaggca acatcatcat gtatgcgccc 10320 
accttgtggg caaggatgat cctgatgact catttcttct ccatccttct agctcaggaa 10380 
caacttgaaa aagccctaga ttgtcagatc -tacggggcct gttactccat tgagccactt 10440 
gacctacctc agatcattca acgactccat ggccttagcg cattttcact ccatagttac 10500 
tctccaggtg agatcaatag g^tggcttca tgcctcagga aacttggggt accgcccttg 10560 
cgagtctgga gacatcgggc cagaagtgtc cgcgctaggc tactgtccca gggggggagg 10620 
gctgccactt gtggcaagta cctcttcaac tgggcagtaa ggaccaagct caaactcact 10680 
ccaatcccgg ctgcgtccca gttggattta tccagctggt tcgttgctgg ttacagcggg 10740 
ggagacatat atcacagcct gtctcgtgcc cgaccccgct ggttcatgtg gtgcctactc 10800 
ctactttctg taggggtagg catctatcta ctccccaacc gatgaacggg gagctaaaca 10860 
ctccaggcca atag^ccatc ctgttttttt cccttttttt ttttcttttt tttttttttt 10920 
tttttttttt tttttttttc tccttttttt ttcctctttt tttccttttc tttcctttgg 10980 
tggctccatc ttagecctag tcacggctag ctgtgaaagg tccgtgagcc gcttgactgc 11040 
agagagtgct gatactggcc tctctgcaga tcaagt ^ 11076 
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Fehlneri Paul 

Tuesday, September 30, 2003 8:06 AM 
'Michael Sasaki, Esq. (msasaki@anadyspharma.OTm)' 
Vainberg, Irina 

Hepatitis C Virus Constructs ... (PCT/US0 1/46350; our file 00342/200h395-WO0) 
Michael: 

In the absence 6f instructions to the coritrary, as rSquiested in bur email of September 26, we have instructed the 
Authorized Office to go ahead with issuing an international Preliminary Examination Report. We will forward the report to 
you in due course: 

Best regards, Paul 

Paul F. Fehlner, Ph.D. 
Darby & Darby P.C. 
805 Third Avenue 
New York, NY 10022 

(212)527-7665j direct 
(212) 753-6237 | fax 

http:/AAnivwdarb vlaw.com 
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